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Mothers’ milk 


The safe use of medicines during breastfeeding is not an easy topic to study, but new parents 
deserve better information on the risks and benefits. 


hen Janet Woodcock first started to practise medicine 
Wes 40 years ago, she quickly realized that her training 

had not equipped her to deal with a common dilemma. 
New mothers were being encouraged to breastfeed their children, 
but was it safe to do so if they were taking medication? “I had never 
received one word of information on that situation,” says Woodcock, 
who now heads the US Center for Drug Evaluation and Research at 
the Food and Drug Administration (FDA). 

Woodcock'’s patients, she says, were “frantic” to do the best for their 
babies. But in the absence of data on whether and how a medicine 
could affect their newborn, mothers were often forced to decide 
between their own health and their child’s. The prevailing medical 
advice — then and now — was, in case of doubt, to stop breastfeeding. 

The situation has improved, she said at a workshop on medications 
and breastfeeding convened by the FDA late last month — but not 
nearly enough. Almost 90% of breastfeeding mothers in the United 
States take a medicine of some sort. For many of those drugs — includ- 
ing commonly used medicines to treat high cholesterol and diabetes 
— doctors still don’t know how to counsel their patients. At the work- 
shop, researchers illustrated how little research is done to answer those 
questions: a search of grants issued by the US National Institutes of 
Health (NIH) on the topic shows only a handful of studies, and most 
focus on HIV medicines. 

The dearth of research comes amid renewed massive public-health 
pushes across the world to encourage mothers to breastfeed. Breastfeed- 
ing has been linked to fewer infections and less time in the paediatri- 
cian’s office, saving parents anxiety and health systems cash. The need is 
particularly acute in countries where money and clean water to buy and 
prepare baby formula are limited. More than a decade into the twenty- 
first century, whether the medicines a breastfeeding mother takes are 
safe is a question that demands more attention. 

It is undeniably difficult to conduct most clinical studies of infants. 
There are logistical challenges: an exhausted mother may not be keen 
to attend extra medical visits, and may not want to divulge the medi- 
cines she has chosen to take while breastfeeding. There are ethical 
challenges: clinical trials involving babies are fraught with ques- 
tions about informed consent, for example. And there are financial 
challenges, too. 

These problems have received little public attention, yet the 
barriers can be surmounted. At the FDA workshop, several researchers 
presented their success stories and lessons learned. Seemingly small 
measures, even changing a nappy or rocking a baby while a mother 
visits a clinic, can encourage women to make the effort to participate 
in a study. Ethical questions can be addressed though careful study 
design, and by paying attention to the benefits of the extra monitoring 
for both individual babies and for mothers. And in 2014, the FDA took 
a step towards raising the visibility of the matter by improving drug 
labels to better display what is known — and unknown — about the 


safety of a given drug for breastfeeding mothers and their children. 
Some researchers are already gathering data and building resources. 
Researchers at the University of California, San Diego, for instance, 
have launched the Mommy’s Milk Human Milk Research Bioreposi- 
tory — the first of its kind, they say. 
At first glance, it might not seem like sexy science for a basic 
researcher: the details ofhow particular drugs are metabolized are more 
the province of drug developers. And indus- 


“Ethical try certainly has a responsibility to address 
questions can open questions around the medicines that it 
be addressed produces. But basic researchers can contrib- 
through careful ute, too. Fascinating research avenues involve 


developmental biology, physiology and the 
microbiome, all of which could provide rele- 
vant information and possibly even advance fields in a fundamental way. 
Funders such as the NIH have taken laudable steps to address women’s 
health issues at the level of basic research, by ensuring that animal stud- 
ies include females when possible and relevant. More researchers and 
funders should build on that momentum and address the impact of 
medicines on breastfeeding mothers and their children. m 


study design.” 


Market forces 


A European plan to commercialize quantum 
technologies needs a bold goal. 


of the American public, goes the famous line by the US 

editor Henry Louis Mencken. It’s actually a paraphrase, but 
the meaning is clear: to make money, it is safe to assume that nobody 
knows anything. 

By rights, then, quantum physics should be extremely profitable. 
The subject is often used as shorthand for knowledge that is reserved 
for a small intellectual elite, with everyone else left scratching their 
heads. As Canadian Prime Minister Justin Trudeau showed last 
month, the quantum world is so weird that to mount even a half- 
decent explanation of its basic principles can bring praise and plaudits. 

Can this widespread ignorance — the puzzlement at how cats can 
be both alive and dead, or how particles can exist in two places at once 
— be capitalized on? The European Commission believes that it can. 
Next week, it will release a plan for a continent-wide drive to turn the 
mysteries of quantum physics into hard cash. 

This plan, called the European Quantum Manifesto, will be officially 
released in the Dutch town of Delft, where the commission hopes a 
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revolution will be born. Eyeing China, Australia, Canada and other 
countries that have invested huge sums of money in quantum technol- 
ogy, Europe does not want to miss out. With €1 billion (US$1.1 billion) 
of funding, scientists and businesses will be expected to translate quan- 
tum research into quantum products to create “a more sustainable, more 
productive, more entrepreneurial and more secure European Union’. 

These are great expectations. Europe is no doubt encouraged by 
the various quantum technologies that have matured in recent years. 
Quantum sensors, for example, can achieve high sensitivity and reso- 
lution through quantum superposition or entanglement, outperform- 
ing classical sensors in various imaging applications. Strategic use of 
funds could indeed take quantum sensors to market in a few years. 

But for most quantum technologies, the path to commerciali- 
zation is much longer and more contrived. The arguable peak of 
quantum technologies — the construction of a universal quantum 
computer — is decades, and billions of euros of targeted investment, 
away. But it promises perhaps the greatest gains: substantially greater 
power for key computations, such as simulations of chemical reactions 
and — maybe — machine learning. 

Revolutions happen through popular uprising and not through 
carefully directed government investment. At some point, investors, 
entrepreneurs and academics are supposed to conspire on this revo- 
lution without directives from above. Hence the European Quantum 
Manifesto seeks to mobilize a broad base of quantum technologists. 
Specifically, it plans an environment in which small, high-potential 
quantum-tech businesses can thrive. 

Given that a large majority of start-up firms fail, how is this plan 
supposed to work in the risky and unproven quantum-technology 
business? Predicting the likely outcome of the European Commission's 
plan is as hard as determining whether Schrédinger’s cat is dead or 
alive without opening its box. 

Can we peek inside the box to get some insights on how this com- 
mercial future might unfold? Nature has designed an experiment to try. 


The project (see go.nature.com/53iiw6) trained seven young quantum 
physicists to conceive and evaluate business ideas in quantum technolo- 
gies. The project culminated in a presentation day last week at Nature's 
London office, where the physicists’ ideas were scrutinized by a panel of 

experienced entrepreneurs and leaders in quantum technologies. 
A PhD student from University College London invented a quantum- 
inspired accelerometer with a relatively safe and clear route to market. 
And two postdocs from the University of 


“This is one New South Wales in Sydney, Australia, have 
project that the ambition to outshine Google and IBM and 
should not build a universal quantum computer based on 
have to bein silicon qubits. 

several places Two of the five ideas that were presented — 
at once.” an invention that permits quantum comput- 


ers to be linked, and a start-up that will design 
quantum machine-learning algorithms — set out to depend on the few 
companies and groups who have already invested huge sums of money 
to try to build quantum-computing hardware. Both ideas are betting 
on being able to sell their products to only a few customers. It sounds 
likea risky strategy, but it might indicate a way to create and sustain the 
necessary critical mass of start-ups that the European Quantum Mani- 
festo is aiming for. Focusing investment on one high-risk, high-gain 
goal — such as a universal quantum computer — could create a string 
of start-ups that each specialize in one integral component or aspect. 
Still, it is unlikely that Europe's quantum-technology initiative will 
take this route. Given the many scientific goals in the manifesto, the 
authors seem to hope that the plan will have its own quantum proper- 
ties and be able to address all the goals simultaneously. That looks like 
a mistake. It would be a missed opportunity ifthe quantum world that 
the commission hopes to create is hamstrung by the small steps and 
endless compromise that haunt other European projects. The initiative 
needs a clear and a bold goal. This is one project that should not have 
to be in several places at once. m 


Smoke out 


Scientists should unite over electronic-cigarette 
regulation, or big tobacco will step in. 


ix million people die every year as a result of tobacco smoking, 

according to an estimate by the World Health Organization. It is 

a number worth keeping in mind as the scientific disputes over 
electronic cigarettes continue to smoulder. 

The US Food and Drug Administration last week announced a 
“historic rule” that gives it the right to regulate e-cigarettes — which 
vaporize nicotine — as it does tobacco products. Nearly all e-cigarettes 
will now have to go through an approval process, with sales to young 
people prohibited, and health warnings included on packaging and 
advertisements. 

Sylvia Burwell, the US Secretary of Health and Human Services, 
noted that e-cigarette use is shooting up among young people in the 
United States, “creating a new generation of Americans who are at risk 
of addiction’, even as cigarette smoking continues to decline. 

Some states are already ahead of federal law — earlier this month, 
California defined e-cigarettes as tobacco products, with all that that 
entails. The European Union is also set to take a tougher stance. An 
EU-wide directive that comes into force this year on tobacco products 
will control nicotine content. 

These ‘vaping’ devices have split researchers. Some see a route to end 
the tobacco scourge. Conventional medicine provides few escapes from 
nicotine addiction, and the speed at which smokers embrace electronic 
systems seems to be a blessing. If the world’s smokers switched from 
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burning to vaping, that figure of six million deaths would fall. 

But other scientists see problems. They fear that electronic devices 
subvert the message that smoking is bad, and offer people a nicotine 
fix in places where cigarettes have long been excluded. They fear a new 
age of nicotine, and that the six-million figure will rise. 

This difference of opinion has spilled messily over into the research 
arena. Published studies are ruthlessly spun or picked apart by oppos- 
ing sides. Sometimes the fight happens even before publication, with 
journalists sent quotes under embargo that critique claims and conclu- 
sions before they are publicly available. 

Both sides are acting in good faith, but their arguments and increas- 
ingly entrenched positions frequently generate more heat than light. 
To progress, researchers on both sides must establish what evidence 
should be gathered to answer the central question: how can e-cigarette 
use and regulation lead to the largest possible reduction in deaths from 
tobacco? As part of this process, they should identify key data that, if 
forthcoming, would change their current view. 

There is good reason for researchers to come together on this, 
and quickly. Conventional tobacco firms are grabbing an increas- 
ingly large share of the e-cigarette market. This should concern 
everyone — and focus minds. Few industries have historically been 
quite so willing to dissemble, and to market products with so few 
benefits and so many harms. 

Researchers should remain focused on the enemy that needs to be 
fought — the horrific harm caused by tobacco. Disputes are part of 
science. They must be conducted in the open, and no researcher — and 
no piece of research — can be immune from 
criticism. But the tobacco-science community 
must find a way forward. It is not hyperbole to 
say that millions of lives are at stake. Six million 
of them are, every year. m 
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MICHAEL TEMCHINE 


0 R L D V EW A personal take on events 


am pleased to announce that as of the middle of April, my Elsevier 
[moisten had received 30,752 page views and 2,025 citations. 

I got these numbers in a promotional e-mail from Elsevier, and 
although I’m not sure what they mean, I presume that it would be 
even better to have even bigger numbers. 

Indeed, the widespread availability of bibliometric data from 
sources such as Elsevier, Google Scholar and Thomson Reuters ISI 
makes it easy for scientists (with their employers looking over their 
shoulders) to obsess about their productivity and impact, and to 
compare their numbers with those of other scientists. 

And if more is good, then the trends for science are favourable. 
The number of publications continues to grow exponentially; it was 
already approaching two million per year by 2012. More impor- 
tantly, and contrary to common mythology, 
most papers do get cited. Indeed, more papers, 
from more journals, over longer periods of time, 
are being cited more often. One likely reason for 
rising citations is the incredible search capabili- 
ties that the web now affords. This would seem 
to be good news. 

But what if more is bad? In 1963, the physicist 
and historian of science Derek de Solla Price 
looked at growth trends in the research enter- 
prise and saw the threat of “scientific dooms- 
day”. The number of scientists and publications 
had been growing exponentially for 250 years, 
and Price realized that the trend was unsustain- 
able. Within a couple of generations, he said, it 
would lead to a world in which “we should have 
two scientists for every man, woman, child, and 
dog in the population”. Price was also an elit- 
ist who believed that quality could not be maintained amid such 
growth. He showed that scientific eminence was concentrated ina 
very small percentage of researchers, and that the number of leading 
scientists would therefore grow much more slowly than the number 
of merely good ones, and that would yield “an even greater prepon- 
derance of manpower able to write scientific papers, but not able to 
write distinguished ones”. 

The quality problem has reared its head in ways that Price could not 
have anticipated. Mainstream scientific leaders increasingly accept 
that large bodies of published research are unreliable. But what seems 
to have escaped general notice is a destructive feedback between the 
production of poor-quality science, the responsibility to cite previous 
work and the compulsion to publish. 

The quality problem has been widely recog- 


nized in cancer science, in which many celllines NATURE.COM 
used for research turn out to be contaminated. _ Discuss this article 
For example, a breast-cancer cell line used in _ online at: 


more than 1,000 published studies actually — go.nature.com/sbsxfj 
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The pressure to publish 
pushes down quality 


Scientists must publish less, says Daniel Sarewitz, or good research will be 
swamped by the ever-increasing volume of poor work. 


turned out to have been a melanoma cell line. The average biomedical 
research paper gets cited between 10 and 20 times in 5 years, and 
as many as one-third of all cell lines used in research are thought 
to be contaminated, so the arithmetic is easy enough to do: by one 
estimate, 10,000 published papers a year cite work based on contami- 
nated cancer cell lines. Metastasis has spread to the cancer literature. 

Similar negative feedbacks occur in other areas of research. 
Pervasive quality problems have been exposed for rodent studies 
of neurological diseases, biomarkers for cancer and other diseases, 
and experimental psychology, amid the publication of thousands of 
papers. 

So yes, the web makes it much more efficient to identify relevant 
published studies, but it also makes it that much easier to troll for 
supporting papers, whether or not they are any 
good. No wonder citation rates are going up. 

That problem is likely to be worse in policy- 
relevant fields such as nutrition, education, epi- 
demiology and economics, in which the science 
is often uncertain and the societal stakes can be 
high. The never-ending debates about the health 
effects of dietary salt, or how to structure foreign 
aid, or measure ecosystem services, are typical 
of areas in which copious peer-reviewed support 
can be found for whatever position one wants 
to take — a condition that then justifies calls for 
still more research. 

More than 50 years ago, Price predicted that 
the scientific enterprise would soon have to go 
through a transition from exponential growth 
to “something radically different’, unknown and 
potentially threatening. Today, the interrelated 
problems of scientific quantity and quality are a frightening mani- 
festation of what he foresaw. It seems extraordinarily unlikely that 
these problems will be resolved through the home remedies of better 
statistics and lab practice, as important as they may be. Rather, they 
would seem — and this is what Price believed — to announce that 
the enterprise of science is evolving towards something different and 
as yet only dimly seen. 

Current trajectories threaten science with drowning in the noise of 
its own rising productivity, a future that Price described as “senility”. 
Avoiding this destiny will, in part, require much more selective pub- 
lication. Rising quality can thus emerge from declining scientific 
efficiency and productivity. We can start by publishing less, and less 
often, whatever the promotional e-mails promise us. = 


Daniel Sarewitz is co-director of the Consortium for Science, 
Policy and Outcomes at Arizona State University, and is based in 
Washington DC. 

e-mail: daniel.sarewitz@asu.edu 


12 MAY 2016 | VOL 533 | NATURE | 147 


© 2016 Macmillan Publishers Limited. All rights reserved. 


RESEARCH HIGHLIGHT 


Zika shrinks ‘mini 
brains’ in culture 


The Zika virus may trigger an 
immune response that causes 
developing brain cells to stop 
dividing and self-destruct. 

The link between Zika 
infection and the birth of 
babies with abnormally small 
heads, or microcephaly, has 
grown stronger, but it is still 
not clear how the virus attacks 
developing brains. Tariq Rana 
at the University of California, 
San Diego, and his team grew 
cerebral organoids — 3D 
structures that model the 
developing brain — from 
human embryonic stem cells 
and then infected them with 
Zika. Over a 5-day period, 
uninfected organoids grew by 
22.6%, whereas those exposed 
to Zika shrank by 16%. 

Zika infection boosted the 
activity of a pathogen-sensing 
gene, TLR3, which has been 
linked to brain inflammation 
and degeneration. Blocking 
the TLR3 protein in infected 
organoids lessened the damage 
caused by the virus. 

Cell Stem Cell http://doi.org/bgrx 
(2016) 


Second ‘skin’ 
turns back time 


A polymer film that sticks 
to human skin reduces the 
appearance of wrinkles and 
bags under the eyes. 

Robert Langer at the 
Massachusetts Institute of 
Technology in Cambridge 
and his colleagues designed a 
polysiloxane-based film that 
is applied to and cured on the 
skin. The transparent film has 
similar mechanical properties 
to skin, allowing it to conform 
to the surface. In small studies 
with human volunteers, the 
researchers showed that the 
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PLANETARY SCIENCE 


Solar wind hits Pluto hard 


The solar wind is diverted by Pluto, suggesting 
that, like some larger planets, the dwarf planet 
has a shield against the stream of energized 
particles emanating from the Sun. 

Before NASA‘s New Horizons spacecraft 
visited the dwarf planet (pictured) in 2015, 
most scientists thought that Pluto interacted 
with the solar wind in the same way as a comet 
does. Comets lack protection from the wind, 
which diffuses around the cometary surface. 


New Jersey and his colleagues identified a 
‘Plutopause’ — a region where Pluto’s tenuous 
atmosphere shields the dwarf planet from the 
solar wind. 

The Plutopause is relatively small and well 
defined, much like the solar-wind boundaries 
around Mars and Venus. Even though Pluto is 
small, it still exerts enough gravitational pull 
to keep its atmosphere sufficiently close to 
provide a buffer from the solar wind. 


But in analysing data from the spacecraft, 
David McComas at Princeton University in 


film reshapes the skin, making 
bags under the eyes look less 
puffy and reducing wrinkling. 
The film was made from 
reagents that are considered to 
be safe for the skin. It could be 
used cosmetically or in wound 
dressings, the authors say. 
Nature Mater. http://dx.doi. 
org/10.1038/nmat4635 (2016) 


Jammed microbes 
feel the pressure 


Microbes living in a confined 
space can push up against 
each other with enough force 
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to physically damage their 
environment. 

A team led by Oskar 
Hallatschek of the University 
of California, Berkeley, created 
a microscopic chamber that 
would hold roughly 100 cells of 
budding yeast (Saccharomyces 
cerevisiae). As the cells 
proliferated, they did not leave 
the chamber ina steady stream 
through a narrow exit channel 
but instead jammed together, 
building up contact pressures 
of almost 1 megapascal. This 
force was enough to cause 
cracks in agar gels containing 
growing yeast cells, and to slow 
down the organism's growth. 


© 2016 Macmillan Publishers Limited. All rights reserved. 


J. Geophys. Res. Space Phys. http://doi.org/bgdv 


Self-driven jamming may 
help microbes to invade 
soft materials, which could 
contribute to biofouling — the 
accumulation of unwanted 
microbial material on surfaces, 
the authors say. 
Nature Phys. http://dx.doi. 
org/10.1038/nphys3741 (2016) 


Immune cell aids 
vascular repair 


White blood cells that gobble 
up cellular debris also help to 
heal damaged blood vessels in 
the brain. 


NASA/JHUAPL/SWRI 


REIDAR HAHN/FERMILAB 


M. HAMEDI ET AL. ADV. FUNCT. MATER. (2016) 


Degqin Yang at Chongqing 
Medical University and Lingfei 
Luo at Southwest University, 
Beibei in Chongqing, China, 
and their co-workers used. 
lasers to rupture blood vessels 
in the brains of zebrafish. 
They monitored the events 
that followed with time- 
lapse microscopy, and saw 
the immune cells, called 
macrophages, migrate to the 
damaged area. A macrophage 
extended projections that 
adhered to the broken 
ends of the blood vessels 
and pulled them together 
by mechanical traction. In 
zebrafish engineered to lack 
macrophages, cerebral blood 
vessels healed more slowly 
than in normal animals. 

The findings expand on the 
known roles of macrophages 
in the brain, the authors say. 
Immunity http://doi.org/bgrv 
(2016) 


Paper strips on 
the move 


Strips of paper embedded 
with a conducting polymer 
can perform a range of 
movements with electrical 
stimulation. 

George Whitesides at 
Harvard University in 
Cambridge, Massachusetts, 
and his colleagues made 


paper actuators that expand 
and contract according to 
their water content. They 
added a conducting polymer 
that coated the fibres of the 
paper, and then applied 
Scotch tape to one side. 
When electrically activated, 
the paper heats up, dries out 
and contracts. When the 
electrical current is turned 
off, the paper absorbs water 
from the air and expands. The 
tape is not affected by heat or 
moisture, so directs the paper 
to bend in certain ways. 

The authors made actuators 
of different shapes, including 
one that could curl up 
(pictured), and say that the 
devices could be used in 
lightweight micromachines. 
Adv. Funct. Mater. 26, 2446-2453 
(2016) 


How an antibody 
combats HIV 


A broadly neutralizing 
antibody against HIV can both 
boost people’s immunity to 
the virus and directly target 
infected cells. 

The antibody, 3BNC117, 
has previously been shown 
to lower HIV levels in the 
blood of patients. To study its 
effect on the immune system, 
Michel Nussenzweig of the 
Rockefeller University in New 
York City and his colleagues 
gave people with HIV one 
dose of the antibody. They 
found that patients with 
higher levels of the virus in 
their blood developed much 
broader neutralizing-antibody 
responses to HIV over six 
months than did those who 
had little to no virus (either 
uninfected individuals or 
people taking antiretroviral 
therapies). This indicated that 
the antibody is boosting the 
patients ability to produce 
other HIV-neutralizing 
antibodies. 

Ina second study, a team 
led by Nussenzweig and 
Arup Chakraborty of the 
Massachusetts Institute of 
Technology in Cambridge 
showed that the same antibody 
speeds up the removal of 
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SOCIAL SELECTION 


Popular topics 
on social media 


Call calf Higgs Bison, says Twitter 


Just weeks after the public voted overwhelmingly to name 
a new UK research vessel Boaty McBoatface, the US 
particle-physics facility Fermilab in Batavia, Illinois, asked 
people on Twitter to name a bison (pictured) that was born 
on its grassy grounds on 26 April. “What would you call 
our new baby bison? Tweet us with #BisonNaming. Please, 
no Bison McBisonface.” The science world stepped up with 


a flood of responses — about 260 in the 
first day. Sandia National Laboratories 
in Albuquerque, New Mexico, offered 
“Neil deGrass Bison”. Other ideas 
included Higgs Bison, Enrico Furry 

(a play on Enrico Fermi, 


> NATURE.COM who discovered many 
For more on radioactive isotopes), 
popular papers: Bison Tennial and 


go.nature.com/kdrhxj 


HIV-infected T cells from the 
blood of mice. 

Science http://doi.org/bgdx; 
http://doi.org/bgdz (2016) 


CRISPR maps 
yeast genes 


The CRISPR-Cas9 gene- 
editing system could be 
harnessed to speed up the 
search for DNA sequences 
linked to specific traits. 

Researchers can identify 
genomic regions that are 
linked to traits, but pinpointing 
the responsible snippet of 
DNA within that region is 
difficult. To speed up the hunt, 
Meru Sadhu and his colleagues 
at the University of California, 
Los Angeles, targeted the Cas9 
enzyme to cut DNA at 95 sites 
on one copy of chromosome 
7 in yeast (Saccharomyces 
cerevisiae). The team then built 
alibrary of yeast strains, each 
with a genetic rearrangement 
at one of the 95 sites — making 
it easier to determine the 
function of a given section 
of DNA. 

The researchers used their 
library to pinpoint a gene 
variant that makes yeast 
sensitive to manganese. 
Science http://doi.org/bgd2 
(2016) 


Niels Bohrson. 


Planet 9 may glow 
from within 


The hypothetical ninth planet 
of the Solar System could 
shine brightly. 

Planet 9, if it exists, is 
thought to be an ice planet 
that is slightly smaller than 
Neptune, orbiting in the far 
outer Solar System. Esther 
Linder and Christoph 
Mordasini of the University 
of Bern in Switzerland 
modelled the evolution of 
the planet’s probable internal 
structure. On the basis of its 
estimated mass and location, 
they conclude that the planet 
is still giving off residual heat 
that was generated when 
it was formed. This would 
cause the planet to emit light 
in the mid- and far-infrared 
range. 

The authors say that future 
telescopes, such as the Large 
Synoptic Survey Telescope, or 
dedicated surveys should be 
able to detect Planet 9 — or 
rule out its existence. 

Astron. Astrophys. 589, A134 
(2016) 


© NATURE.COM 
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SEVEN DAYS nescnni 


EVENTS 


Mercury transit 
NASA’ Solar Dynamics 
Observatory captured 
Mercury’s 7.5-hour trek across 
the face of the Sun on 9 May. 
Such planetary transits are 
relatively rare: Mercury and 
Venus pass between Earth 
and the Sun only occasionally. 
Because Mercury’s orbit 

is tilted by 7° relative to 
Earth’s, the innermost planet 
of the Solar System passes 
between our planet and the 
Sun only about 13 times a 
century; the next time will 
bein 2019. Mercury’s transit 
of the Sun can give scientists 
spectroscopic clues about the 
chemistry of the planet's thin 
atmosphere. 


E-cig clampdown 


A US clampdown on electronic 
cigarettes (e-cigarettes) is in 
the offing after the US Food 
and Drug Administration 
finalized its stance on the 
controversial products. The 
agency, which currently 
regulates cigarettes and rolling 
tobacco, announced on 

5 May that it would extend its 
authority to cover e-cigarettes, 
hookah tobacco and other 
products containing nicotine 
from 8 August. This will 
prevent their sale to under 

18s. Researchers are divided 
over the benefits and harms of 
e-cigarettes, but Sylvia Burwell, 
the US Secretary of Health and 
Human Services, said that the 
“drastic leap” in the use of such 
products was “creating anew 
generation of Americans who 
are at risk of addiction” See 
page 146 for more. 


TTIP opposition 
The controversial trade deal 
being negotiated between 
the European Union and the 
United States is in jeopardy 
after French President 


Blazing inferno in Alberta 


Almost 90,000 people were evacuated from the 
city of Fort McMurray last week after wildfires 
raged in the Canadian province of Alberta. 

By 8 May, the huge blaze had destroyed some 
160,000 hectares of land and forest. Thanks to 
wet weather, fire-fighting conditions became 
more favourable over the weekend, but several 


Francois Hollande said 

on 3 May that he will not 
accept poorly regulated 

free trade that questions 
some of France's “essential 
principles”. His comments 
followed the leak on 2 May 
of negotiation papers 
concerning the Transatlantic 
Trade and Investment 
Partnership (TTIP). Among 
other things, critics say that 
TTIP could force the EU to 
abandon its precautionary 
principle in dealing with 
health, environmental and 
food-safety issues. Currently, 
some manufacturers and 
importers have to prove that 
their product is harmless if 
scientific data do not permit 
a complete evaluation. 
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Although France has no veto 
over the deal, it could block 
the negotiation process. 


Australian budget 


There were no big winners or 
losers for science in Australia’s 
2016-17 federal budget, 
released on 3 May. But the 
government announced 
long-term investment plans 
for the Australian Antarctic 
Territory. The budget — 
which is released ahead of 
national elections in July — 
includes Aus$496.2 million 
(US$364 million) by 2050 to 
protect Australia’s strategic 
interests in Antarctica and 

to sustain its scientific, 
environmental and economic 
operations there. The 
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fires in the region are still out of control. 
Although the flames have not reached any of 
Alberta’ oil-processing facilities, these have 
been closed, and efforts to stop the fire from 
spreading to neighbouring Saskatchewan may 
continue for weeks, authorities said. The cause 
of the disaster is still under investigation. 


government will also provide 
Aus$55 million over the next 
decade for enhanced research 
and transport infrastructure in 
the territory, including support 
for over-ice convoys to research 
stations or drilling sites and 
year-round aviation access. 


ee PEOPLE | 
UN climate chief 


A seasoned Mexican diplomat 
and former foreign minister 

is to become the world’s 

top climate official. Patricia 
Espinosa Cantellano, currently 
Mexican ambassador to 
Germany, was nominated on 

3 Mayas executive secretary of 
the United Nations Framework 
Convention on Climate 


OPHER SEGUIN/REUTERS 


5 Change by UN secretary- 

g general Ban Ki-moon. 

2 Espinosa, a human-rights 

specialist with 30 years of 

high-level experience in 

international relations, is to 

succeed Costa Rica’s Christiana 

S Figueres, who will leave in July 
after six years in office. 


Rockefeller head 


Geneticist Richard Lifton 
will be the 11th president of 
the Rockefeller University in 
New York City, the university 
announced on 5 May. Lifton, 
62, who received the 2014 
Breakthrough Prize in Life 
Sciences for his work on the 
genetic causes of high blood 
pressure, will take office in 
September. Currently the 
Sterling Professor of Genetics 
and chair of genetics at Yale 
University in New Haven, 
Connecticut, Lifton will 
succeed Marc Tessier-Lavigne, 
who is leaving Rockefeller to 
become president of Stanford 
University in California. 


| RESEARCH 
Giant stick insect 


Scientists in China say that 
they have found the world’s 
longest insect — a new species 
of stick insect that is well over 
halfa metre long. Zhao Li of the 
Insect Museum of West China 
in Chengdu told the Xinhua 
news agency that he found the 
creature in 2014, ona mountain 
near Liuzhou City in southern 


TREND WATCH 


Social habits in a country dictate 
how long its residents sleep at 
night. Data were collected from 
5,459 people in 20 countries 

who used a smartphone app 
called ENTRAIN to record 

their daily bed times, wake 

times and exposure to outdoor 
and indoor light over 1 year. 

The analysis showed that social 
pressures to ignore the biological 
signals of sleepiness send 

people — particularly men — to 
bed late, but do not affect the time 
they wake up (O. J. Walch et al. Sci. 
Adv. 2, 1501705; 2016). 
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SOURCE: O. J. WALCH, A. COCHRAN & D. B. FORGER SCI. ADV. 2, £1501705 (2016) 


China. At 62.4 centimetres 


long, the insect — which has 
been dubbed Phryganistria 
chinensis Zhao (pictured), 
although it has not yet been 
formally described — is almost 
6cm longer than the previous 
record holder, Phobaeticus 
chani,a stick insect from 
Malaysia discovered in 2008. 


Millions for LIGO 


More than 1,000 scientists 

and engineers will share 

a US$3-million Special 
Breakthrough Prize in 
Fundamental Physics for the 
discovery of gravitational waves 
from colliding black holes. 

Ron Drever, Kip Thorne and 
Rainer Weiss, who co-founded 
the Laser Interferometer 
Gravitational- Wave 
Observatory (LIGO) — two 

US detectors where signals of 
elusive gravitational waves were 
first discovered last September 
— receive equal parts of 

$1 million, a prize committee 
announced on 2 May. The 


GET AN EARLY NIGHT 


other 1,012 contributors to the 
experiment worldwide will 
share the remaining $2 million 
equally. On 4 May, the LIGO 
team was also awarded the 
$500,000 Gruber Foundation 
Cosmology Prize. 


FACILITIES 


Boaty no more 

The United Kingdom's new 
polar research ship will 

be named after naturalist 

and broadcaster David 
Attenborough, the UK 
government announced 

on 6 May. A public vote to 
suggest a name for the planned 
£200-million (US$289-million) 
vessel was overwhelmingly in 
favour of Boaty McBoatface. 
Instead, the ship will be 
christened RRS Sir David 
Attenborough — but Boaty 
McBoatface will live on as the 
name of the ship’s remotely 
operated submersible. 


Chimp haven 

The world’s largest chimpanzee 
research facility is retiring 

its chimps. The New Iberia 
Research Center (NIRC) 

in Lafayette, Louisiana, 
announced on 3 May that in the 
summer it will begin to relocate 
all of its 220 primates — at up 
to 10 ata time — to Project 
Chimps, a sanctuary for 
chimpanzees in Blue Ridge, 
Georgia. NIRC was among the 
last labs in the world to allow 
invasive biomedical research on 


A smartphone app reveals how sleep habits differ between countries. 
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17-18 MAY 

Details of a €1-billion 
initiative in quantum 
technology are 
announced at the 
Quantum Europe 
Conference in 
Amsterdam. See 
go.nature.com/7xdf34 
for more. 
go.nature.com/hne6pn 


17-22 MAY 

The 10th World 
Biomaterials Congress 
takes place in Montreal, 


Canada. 
www.wbe2016.org 


chimpanzees before the United 
States effectively banned the 
practice in 2015 and declared 
captive chimpanzees to be an 
endangered species. NIRC says 
that its retirement plans had 
been in place since 2014. 


Access control 


Thousands of journals are 
being removed from the 
Directory of Open Access 
Journals (DOAJ) in response to 
concerns about the increasing 
number of ‘predatory 
publishers’ with dubious 
peer-review and publishing 
practices. In abid to tighten 
standards for inclusion, the 
DOAJ had asked more than 
11,000 open-access journals 
listed on the directory to 
provide details about their 
operations. About 3,300 
journals did not submit the 
requested information in 
time, so will now be delisted, 
says Lars Bjornshauge, the 
directory’s managing director. 
Over the past 2 years, the 
DOAJ has rejected more than 
5,400 open-access journals, 
often owing to questionable 
publishing ethics or lack of 
editorial transparency. See 
go.nature.com/t7aioi for more. 
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TECHNOLOGY 


The Oculus Rift virtual-reality headset costs about US$600. 


Low-cost headsets boost 
virtual reality’s lab appeal 


A wave of user-friendly devices is making the technology an attractive research tool. 


BY DAVIDE CASTELVECCHI 


evices that have slashed the cost of 
De= reality, and transformed its 
performance, have implications for 
scientists as well as gamers. Researchers who 
are experimenting with the head-mounted dis- 
plays say that they have the potential to find 
widespread use as a research tool. 
Virtual reality (VR), which lets users experi- 
ence a computer-generated, three-dimensional 
world, has produced recurring waves of hype 


since the 1980s — but this time could be dif- 
ferent, says Mel Slater, a computer scientist at 
the University of Barcelona in Spain who has 
worked in the field for two decades. Thanks to 
technologies originally developed for smart- 
phones and video-gaming graphics, the per- 
formance of these headsets is now comparable 
to that of high-end devices that cost tens of 
thousands of dollars. They are sophisticated, 
affordable and user-friendly enough to become 
a staple of research labs, says Slater, rather than 
tools available to only very few researchers. 
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A gadget that has transfixed technology-news 
outlets is the Oculus Rift, made by Facebook- 
owned start-up Oculus VR of Menlo Park, 
California. It costs US$600 — but operating 
it also requires a high-end computer that can 
cost more than $1,000. Similarly priced gadgets 
made by smartphone-maker HTC and Sony are 
expected to become available this year. Vastly 
cheaper sets made by Google and Samsung turn 
a smartphone into a more basic VR device. 

A lab can now buy a VR device without a 
dedicated equipment grant, says Anthony 
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> Steed, a computer scientist who heads a 
virtual-environments group at University 
College London. 

He and Slater have been experimenting for 
more than a year with early prototypes of the 
HTC and Oculus devices, and say that the per- 
formance is just as good as that of higher-end 
devices, and getting better. The new devices are 
light enough to be worn for extended periods, 
and they react quickly to the user’s movement, 
preventing the motion sickness that can occur 
when using VR. “Two to three years ago, the 
lab we used for our research cost €100,000 
[US$114,000] to set up. Now we can do the 
same for about €4,000,” says Slater. 

For years, Slater has run VR experiments 
with psychologists, including one that tested 
how white people’s biases change after they 
have virtually inhabited the body of a black 
person. 

Last week, Slater and Daniel Freeman, a clin- 
ical psychologist at the University of Oxford, 
UK, and their collaborators published a study 
that suggests that VR could help to treat people 
with severe paranoia, who often avoid crowded 
places because of a perception that other peo- 
ple want to hurt them (D. Freeman et al. Br. 
J. Psychiatr. http://doi.org/bgrr; 2016). The 
experimental therapy attempts to teach people 


to lower their defences and to trust others by 
letting them visit virtual environments such as 
crowded lifts or underground trains. 

Other studies have used VR to try to treat 
post-traumatic stress disorder and fear of 
heights or spiders. These experiments used 
expensive, high-end gear, but several of the 
researchers involved say that they now plan to 
start using consumer headsets instead. 

As well as being cheap, the headsets are 
simple to set up. “It’s a proper out-of-the-box 
experience,’ says Steed. If larger studies prove 
the therapies to be effective, patients could 
borrow the equipment and use it at home, 
Freeman says. 

Neuroscientist Elizabeth Buffalo at the Uni- 
versity of Washington in Seattle is also con- 
sidering how to use the Oculus Rift. Her team 
studies monkeys as the animals explore inter- 
active environments that are represented on 
a screen. Head-mounted sets that create a 3D 
environment would create a more immersive, 
and therefore natural, experience, she says, but 
current products are too big to fit on a mon- 
key’s head. “We are working on hacking the 
Oculus to achieve this,” Buffalo says. 

Creating complex virtual environments 
still requires specialized computer skills, says 
Slater. But costs are falling now that some 


software developed to aid video-game compa- 
nies is free to use, and many labs outsource the 
work. A related technology called augmented 
reality (AR), which superimposes images onto 
the user’s field of view rather than replacing 
the scene with a different one, could also be 
of use in the lab, helping researchers to visual- 
ize and share data sets, says Mark Billinghurst, 
who studies human-computer interaction at 
the University of South Australia in Adelaide. 

Google Glass, an early attempt at AR that 
projected images into the corner of a pair of 
glasses, was acommercial flop, but Microsoft is 
about to launch a more sophisticated AR head- 
set called HoloLens. “With AR technology 
like HoloLens,” says Billinghurst, “research- 
ers could easily see a complex virtual data set 
superimposed on a real table in front of them, 
and also see each other face to face across the 
table and talk about the data.” 

Mary Whitton, a computer scientist who 
works on virtual environments at the Univer- 
sity of North Carolina at Chapel Hill, says that 
there is still room for improvement in the way 
the systems track users’ motions and in how 
users can interact with the virtual world using 
their hands. Still, she says: “P've had most fun 
seeing how people use what we've built in ways 
we never imagined” m= 


Gene variants linked to 
education prove divisive 


Study uncovers 74 genetic markers that influence the number of years spent in education. 


BY ERIKA CHECK HAYDEN 


he largest-ever genetics study in the 
se sciences has turned up dozens 

of DNA markers that are linked to the 
number of years of formal education an indi- 
vidual completes. The work, reported this 
week in Nature, analysed genetic material from 
around 300,000 people. 

“This is good news,’ says Stephen Hsu, a 
theoretical physicist at Michigan State Univer- 
sity in East Lansing, who studies the genetics of 
intelligence. “It shows that if you have enough 
statistical power you can find genetic variants 
that are associated with cognitive ability.” 

Yet the study’s authors estimate that the 
74 genetic markers they uncovered comprise 
just 0.43% of the total genetic contribution 
to educational achievement (A. Okbay et al. 
Nature http://dx.doi.org/10.1038/nature17671; 
2016). By themselves, the markers cannot 


154 | NATURE | VOL 533 | 12 MAY 2016 


Genetic differences explain just 3.2% of the variation 
in educational achievement between people. 
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predict a person’s performance at school. 
And because the work examined only people 
of European ancestry, it is unclear whether 
the results apply to those with roots in other 
regions, such as Africa or Asia. 

The findings have proved divisive. Some 
researchers hope that the work will aid 
studies of biology, medicine and social policy, 
but others say that the emphasis on genet- 
ics obscures factors that have a much larger 
impact on individual attainment, such as 
health, parenting and quality of schooling. 

“Policymakers and funders should pull the 
plug on this sort of work,’ said anthropologist 
Anne Buchanan and genetic anthropologist 
Kenneth Weiss at Pennsylvania State Univer- 
sity in University Parkin a statement to Nature. 
“We gain little that is useful in our understand- 
ing of this sort of trait by a massively large 
genetic approach in normal individuals” 

The study is the latest to apply genetic 
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analysis to social science. Some of its 
authors have also studied the genetics of 
happiness, and plan to examine the genet- 
ics of fertility and of risk-taking behaviour. 

“There's been a long-standing assump- 
tion that [genetic] differences among peo- 
ple are not really relevant for social-science 
studies,” says study co-author Christopher 
Chabris, a cognitive psychologist at Union 
College in Schenectady, New York. “The 
main effect of this work may be the increas- 
ing realization that genetic differences 
matter, and now people can start to figure 
out how and why.’ 

Robert Plomin, a behavioural geneticist 
at King’s College London, agrees. The 
study’s authors identified 9 million genetic 
variants that, as a group, have some influ- 
ence on school success; these include the 
74 genetic markers that show strong indi- 
vidual influence. Considered as part of an 
overall ‘polygenic’ score, the variants explain 
3.2% of the differences in educational 
attainment between individuals. Plomin 
says that such studies could pave the way 
to predictive genetics for traits such as how 
well children perform on standardized tests. 

Still, the researchers estimate that a person 
who carries two copies of the genetic variant 
that has the strongest known effect would 
complete nine more weeks of schooling over 
a lifetime than a person with no copies. 

The authors also report that the markers 
they found overlap with those associated 
with better performance in cognitive tests, 
bolstering the idea that educational attain- 
ment is a proxy for intelligence. Because 
few large studies have tested individuals’ 
cognitive performance, it has been dif- 
ficult to discern genetic factors linked to 
intelligence. But it is much easier to amass 
large amounts of data that have sufficient 
statistical power to uncover genetic effects 
related to educational attainment, because 
medical studies routinely record data on 
participants’ years of schooling. 

Hsu predicts that growing knowledge of 
genetic contributions to intelligence could 
be used to help parents to select embryos 
created through in vitro fertilization. “You 
could allow the parents to decide whether 
they want to implant or not implant an 
embryo that has a serious cognitive impair- 
ment,” Hsu says. “What is missing is the 
ability to know what places in the genome 
are affecting cognitive ability, but studies 
like this one will get us to that point.” 

But even if all the genetic contributors to 
educational attainment were known, the 
study’s authors say, their effect would prob- 
ably be overshadowed by other factors such 
as the socio-economic and educational 
status of a child’s family. Says Chabris, “It 
would be irresponsible to look at a poly- 
genic score and use it to make a prediction 
for a single individual” m 
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ARCHAEOLOGY 


Sacred tattoos found 
on Egyptian mummy 


Unusual designs include eyes, flowers and animals. 


BY TRACI WATSON 


mummy from ancient Egypt was 
At tattooed with sacred sym- 

bols, which may have served to 
advertise and enhance the religious powers 
of the woman who received them more than 
3,000 years ago. 

The tattoos are the first found on a mummy 
from dynastic Egypt to show actual objects, 
among them lotus blossoms on the mummy’s 
hips, cows on her arm and baboons on her 
neck. Just a few other ancient Egyptian mum- 
mies sport tattoos, and those are merely pat- 
terns of dots or dashes. 

Especially prominent among the mummy’s 
tattoos are ‘wadjet eyes’: possible symbols of 
protection against evil that adorn the mum- 
my’s neck, shoulders and back. “Any angle that 
you look at this woman, you see a pair of divine 
eyes looking back at you,’ says bioarchaeologist 
Anne Austin of Stanford University in Cali- 
fornia, who presented the findings last month 
at a meeting of the American Association of 
Physical Anthropologists in Atlanta, Georgia. 

Austin noticed the tattoos while examining 
mummies for the French Institute of Oriental 
Archaeology in Cairo, which conducts research 
at Deir el-Medina, a village once home to the 
ancient artisans who worked on tombs in the 
nearby Valley of the Kings. Looking at a head- 
less, armless torso dating from 1300 Bc to 
1070 Bc, Austin noticed markings on the neck. 
She soon realized that they were tattoos. 
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Austin knew of tattoos discovered on other 
mummies using infrared imaging (M. Sama- 
delli et al. J. Cult. Herit. 16, 753-758; 2015), 
which peers more deeply into the skin than 
visible-light imaging. With help from infra- 
red lighting and an infrared sensor, she deter- 
mined that the Deir el-Medina mummy boasts 
more than 30 tattoos, including some on skin 
so darkened by the resins used in mummifica- 
tion that they were invisible to the eye. Austin 
and Cédric Gobeil, director of the French pro- 
ject at Deir el-Medina, digitally stretched the 
images to counter distortion from the mum- 
my’s shrunken skin. 

The tattoos identified so far carry powerful 
religious significance. Many, such as the cows, 
are associated with the goddess Hathor, one of 
the most prominent deities in ancient Egypt. 
The symbols on the throat and arms may have 
been intended to give the woman a jolt of mag- 
ical power as she sang or played music during 
rituals for Hathor. 

The tattoos may also be a public expres- 
sion of the woman's piety, says Emily Teeter, 
an Egyptologist at the University of Chicago's 
Oriental Institute in Ilinois. “We didn’t know 
about this sort of expression before,” Teeter 
says, adding that she and other Egyptologists 
were “dumbfounded” when they heard of the 
finding. 

Austin has already discovered three more 
tattooed mummies at Deir el-Medina, and 
hopes that modern techniques will uncover 
more elsewhere. m 
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NUCLEAR SECURITY 


Biologists struggle with 
push to end use of caesium 


Proposed switch to X-ray irradiators could affect results of research. 


BY JEFF TOLLEFSON 


nybody who wants to conduct 
Aceainen on mice in Margaret 

Goodell’s immunology lab must sub- 
mit to a host of security measures, starting 
with a background check by the FBI. That's 
because Goodell, a researcher at Baylor 
College of Medicine in Houston, Texas, uses 
a caesium-based irradiator to destroy bone 
marrow in mice that are set to receive stem- 
cell transplants. The US government fears 
that the radioactive caesium could be stolen 
to make a ‘dirty’ bomb. 

Now the US National Nuclear Security 
Administration (NNSA) is working with sci- 
entists to investigate how — or whether — to 
replace caesium irradiators with less dangerous 
X-ray technology. Researchers have used the 
caesium devices for decades, to study every- 
thing from immunotherapy to cancer treat- 
ment, and some fear that switching to X-ray 
irradiators will affect their results. 

Goodell, who has found subtle differences 
in how the mouse immune system responds 
to the two types of device, prefers Baylor’s 
caesium irradiator. Her research has revealed 
that immune cells called B lymphocytes 
recovered more slowly in mice treated with 
an X-ray irradiator than in those exposed 
to caesium. But other immune cells, known 
as myeloid cells, rebounded faster after the 
X-ray treatment (B. W. Gibson et al. Comp. 
Med. 65, 165-172; 2015). Because of this, she 
says, “it would be difficult to compare studies 
using X-rays to the research that was done ten 
years ago’. 

For nuclear regulators, the risk posed 
by caesium is clear. The element’s highly 
radioactive isotope caesium-137 comes ina 
powdered form that can be dispersed in air 
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Biomedical researchers often use caesium-137 to irradiate cells. 


or water; exposure to the substance can cause 
burns, radiation sickness or death, depend- 
ing on the dose. Caesium irradiators, which 
have long been used to eliminate pathogens in 
supplies of blood as well as for research appli- 
cations, rely on small capsules of radioactive 
caesium chloride encased in a lead-covered 
box. There are more than 800 such devices in 
US medical and research facilities. 

Several countries — including France, 
Norway and Japan — are shifting away from 
using caesium irradiators in blood banks 
because of security fears, and last year the 
NNSA began working with hospitals in the 
United States to do the same. But finding 


alternative ways to treat blood is relatively 
simple. The NNSA is working with research- 
ers to pin down the more complicated issue 
of how X-ray irradiators might differ from 
conventional caesium instruments for other 
applications. 

“You talk to the doctors, and they are afraid 
that we are going to be taking away their 
devices,’ says Maegon Barlow, director of radi- 
ological security at the NNSA. “But it’s really 
trying to facilitate, not force.” 

The agency is negotiating with the Mount 
Sinai Health System in New York City to sup- 
port a new round of studies that will compare 
X-ray and caesium irradiators. Jacob Kamen, 
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Mount Sinai’s chief radiation-safety officer, 
notes that some researchers there have 
already conducted similar experiments. 

Peter Heeger, head of organ-transplant 
research at Mount Sinai’s Icahn School 
of Medicine, and his colleagues use 
caesium irradiators when testing immune 
responses in people who are going to 
receive organs. To predict whether a 
recipient's body will reject a new organ, the 
researchers culture B lymphocytes from 
the organ donor and test them against 
immune cells from the recipient. But 
B lymphocytes will not divide unless they 
are activated — here, by the presence of 
connective-tissue cells called fibroblasts. 
Heeger’s team irradiates the fibroblasts to 
prevent them from replicating during this 
process. The scientists have run a series 
of unpublished experiments to determine 
how much X-ray radiation is necessary to 
suppress fibroblast growth. 

“Now we know, and we are now 
comfortable switching for this particular 
procedure,’ says Heeger. 

But Goodell says that many researchers 
would have to conduct lengthy experi- 
ments to ensure that they can make the 
transition without losing confidence in 
their results. Nor is she convinced that 
a switch to X-rays is necessary, given 
the security safeguards that are already 
in place. Anybody who needs to use the 
caesium irradiator at Baylor must present 
a security badge, enter a personal iden- 
tification number and then submit to an 
iris scan. And ifa person inside the secure 
room that contains the irradiator breaches 
any security protocols, an alarm automati- 
cally goes off in the university’s security 
office. 

“As a biologist, it’s not clear to me what 
case has been made for [caesium irra- 
diators] being an enormous security risk,” 
she says. 

Advocates of ending use of the devices 
say that the goal is to eliminate the risk 
of nuclear material falling into the wrong 
hands wherever possible. The security 
measures in place to protect caesium irra- 
diators would not necessarily prevent the 
theft of nuclear material by somebody with 
permission to access these instruments, 
says Charles Ferguson, president of the Fed- 
eration of American Scientists in Washing- 
ton DC. Efforts to secure nuclear materials 
are often focused on this ‘insider threat’, as 
well as the disposal and recycling of irradia- 
tors, which can contain enough caesium to 
pose a hazard for centuries. 

“IT would not want humanity to lose the 
benefits of science,” says Ferguson. “But if 
we can develop alternative technologies 
that prove comparable and can reduce the 
security threat to zero, I think that’s a good 
thing? = 
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Q&A Thierry Mandon 
France’s research 
transformation 


Thierry Mandon, who became France’ research and higher-education minister last June, has 
vowed to cut bureaucracy in a research system that is undergoing major changes. In 2013, 

laws were passed to accelerate the consolidation of universities, prestigious ‘grand écoles and 
research-agency labs into regional clusters that could develop common research policies. 

And in April, Mandon announced measures to further reduce researchers paperwork and 
administrative burden. He talks to Nature about what he hopes to achieve in the year remaining 


before France’ presidential elections in 2017. 


What are the most urgent items on 

your to-do list? 

To simplify the rules that govern higher 
education, and research. To have more PhD 
students and researchers recruited by compa- 
nies and by the public sector, and so instil a 
culture of research in the places where deci- 
sions are made. To help universities to develop 
their own sources of income, so that they can 
be more independent of the state. To promote a 
renaissance of the social sciences. And to spur 
the digitalization of higher education. 


What about funding? Scientists have warned 
that France’s basic research is endangered by 
a lack of funding, in particular by cuts at the 
National Research Agency (ANR). 

The ANR has seen its annual budget fall by 
around €250 million (US$285 million) since 
2012, to around €550 million. As a result, the 
success rate of grant applications is too low, 
at just under 10%. As President Hollande 
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announced in March, we plan to increase the 
ANR’s budget by 10% this year, and by 20% 
next year, to bring it back to €800 million by 
2018 — around the same level as its peak in 
2008. We aim to boost the success rate of grant 
applications to between 14% and 20% in 2017. 

Researchers have been critical of ANR 
bureaucracy. We plan to introduce a series of 
50 reforms, most taking effect later this year, to 
lessen the administrative burden throughout 
the research and higher-education systems. 
The process of preparing grant proposals for 
the ANR will be greatly simplified, as will 
procedures for their assessment. 


What else do you hope to put in motion 

before next year’s elections? 

We want to reinvigorate the social sciences. 
Research in the social sciences remains very 
focused on publications — but value will be 
created through greater contact with the 
hard sciences and through social-science > 
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> researchers informing business and 
policy leaders. For example, after the ter- 
rorist attacks in France last year, the CNRS 
[France’s National Centre for Scientific 
Research] got all the relevant research 
groups working to better understand the 
problem of radicalization. We plan to 
launch new research programmes, with an 
initiative at the University of Paris-Sud’s 
campus d’Orsay which will bring together 
much of our expertise in the social sciences. 


France has often had a reputation of 
lagging in innovation — yet French start-up 
firms seem to be an emerging force. 
France has made it easier for researchers to 
become entrepreneurs. But we dont suc- 
ceed in making start-ups grow. Part of the 
problem is that the strategy of start-ups is 
too often to be bought out by firms in other 
countries. There’s still too little direct con- 
tact between companies and universities, 
and we are working to improve this. Wealth 
creation must become one of the missions 
of the universities. Moreover, the universi- 
ties are still 90% dependent on state fund- 
ing. More direct links with companies 
could also allow universities to generate 
more durable financing themselves. 


Is a problem in the perception of French 
science that the research community 
seems to promote its successes much less 
than do, say, its US and UK counterparts? 

It is a problem. I spend my time telling 
French researchers to sell themselves a bit 
better. Take the example of the recent dis- 
covery of gravitational waves. There was a 
simultaneous press conference in Italy, in 
France and in the United States. In France, 
it was a low-key event in a minuscule room 
at CNRS, where our researchers expressed 
everything very modestly. By contrast, at 
the US event [at the National Press Club in 
Washington DC], one had the impression 
that we were at a White House event. 

I’m not saying that French researchers 
should become as excessive as the Ameri- 
cans can sometimes be in their capacity 
to sell their advances. But in the modern 
world, we need to be a bit more promo- 
tional to make our excellence in research 
better known. At the same time, I respect a 
lot the sort of ethical aspects of their mod- 
esty, which has a good side. 

It often seems very difficult to create 
change in France. But universities are inno- 
vating. My big message is that France is in 
the process of profoundly changing, and 
that researchers often aren't really taking 
the measure of that change. = 


INTERVIEW BY DECLAN BUTLER 
This interview has been translated from Franch 


and edited for length and clarity. See go.nature. 
com/o79dmq for a longer version. 


158 | NATURE | VOL 533 | 12 MAY 2016 


Neuroscientist Elena Cattaneo has made complaints about the Human Technopole to the Senate. 


Row grows over 
biomedical centre 


Document submitted to the Italian Senate criticizes institute 
that will oversee a €1.5-billion project. 


BY ALISON ABBOTT 


plan to create a €1.5-billion 
A (US$1.7-billion) centre for biomedical 

and nutritional research has been 
causing rifts between Italian scientists ever 
since Prime Minister Matteo Renzi announced 
it last November. Now the row has escalated, 
courtesy of a 48-page document submitted to 
the Italian Senate on 4 May by Senator Elena 
Cattaneo, who is also a neuroscientist at the 
University of Milan. 

In the document, she complains that the 
idea for the centre, called the Human Techno- 
pole, was conceived by a small group of scien- 
tists behind closed doors, and that the large 
sum of money involved should not be concen- 
trated in a single project, in particular because 
Italy’s research community as a whole has been 
starved of funds for years. 

“To allocate money in this way without dis- 
cussion of ideas corrupts the ethics of science,” 
Cattaneo told Nature. 

That sentiment is in line with arguments 
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already made by Cattaneo and others. 
Cattaneo’s report also lists a series of 
complaints against the Italian Institute of 
Technology (IIT) in Genoa, which Renzi 
has designated to oversee the Technopole 
project. 

The complaints against both institutes are 
“entirely political’, says Roberto Cingolani, 
who is the Technopole’s main architect and 
director of the IIT. He designed the Techno- 
pole concept together with scientists from 
various universities and research insti- 
tutes in Milan, and now plans to submit a 
detailed rebuttal of Cattaneo’s document to 
the parliament. 

Like the IIT, the Human Technopole was 
approved by government decree, and, although 
supported with public money, will be a private 
foundation. As such, it will avoid much of the 
red tape that holds back state universities and 
publicly funded research institutes. 

According to Cingolani’s plan, the Techno- 
pole will focus on genomics and personalized 
medicine, with an emphasis on nutrition, 


TANIA/A3/CONTRASTO/EYEVINE 


cancer and neurodegenerative diseases. The 
plan is now being evaluated by a panel of 
international scientists. 

But many researchers are incensed that the 
project was announced without an open call 
for ideas. “The evaluators should have had the 
opportunity to compare different proposals,” 
says astrophysicist Giovanni Bignami, former 
director of the Rome-based National Institute 
for Astrophysics. 

Earlier this year, physicist Giorgio Parisi 
at the Sapienza University of Rome initiated 
a petition, now signed by more than 
72,000 people, arguing for Italy to invest more 
in research. But even he takes issue with the 
way in which the cash is to be doled out. “An 
investment of this magnitude should have 
involved the whole scientific community, and 
different projects should have been compared,” 
he says. 

Supporters of the Technopole say that 
what matters is the progress of Italian 
science, not the specifics of how the project 
was chosen, and that the government is within 
its rights to set up such a centre by decree. It 
is “nothing unusual for a government to set 
science policy’, says neuroscientist Emilio 
Bizzi at the McGovern Institute for Brain 
Research at MIT in Cambridge, Massachu- 
setts, and a member of the IIT scientific 
advisory board. 


Cattaneo’s report also questions the choice 
of the IIT to coordinate the Technopole 
project. She notes that although the IIT is rated 
top among the country’s institutes for nano- 
technology when measured by the impact of its 
publications, it is not in the top five for the life 
sciences or biomedicine, which are the subjects 


that will be the focus 
“An investment of the Technopole. 
of this She cites a news- 
magnitude paper article from 
should have 6 January that 
involved the reported that the 
whole scientific | [IT had not spent 


all of the money it 
received in 2013, and 
raised the issue of why the executive had not 
turned down the payments if it was not going 
to use them, so that they could be used by other 
research institutes. And Cattaneo’s report says 
that, according to the IIT’s internal regulations, 
the institute appoints members of a national 
committee to evaluate the institute's progress, 
without the oversight of an external body. 
Cingolani refutes all of these criticisms. 
He says that there are many ways to measure 
scientific success, and accuses Cattaneo of 
cherry-picking the facts to fit her argument. 
He points out that any money that the IIT 
doesn't spend gets returned to the state. And 
he says that the IIT undergoes many levels of 


community.” 
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evaluations and that all are carried out according 
to best practice. “I am preparing my rebuttal 
line by line, point by point,” he told Nature. 

Parliament has yet to decide on whether 
to debate the issues raised by Cattaneo’s sub- 
mission. But the ongoing public discussion is 
fuelling calls for Italy to reform how it funds 
research. 

It is one of the few countries in the Euro- 
pean Union without a national research 
agency, and in a Correspondence in this week's 
Nature, 15 Italian members of Europe's life- 
sciences organization EMBO emphasize the 
need for such an agency, to provide “transpar- 
ent jurisdiction over the funding and execution 
of research” (see page 179). “The agency,’ the 
scientists add, “would also monitor the pro- 
gress of the Human Technopole and oversee 
its accountability.” m 


CORRECTIONS 

The News story ‘Human embryos grown in 
the lab for longest time ever’ (Nature 533, 
15-16; 2016) wrongly characterized the 
US 14-day restriction on in vitro growth of 
human embryos as a law — it is a guideline. 
And the News Feature ‘The material 

code’ (Nature 533, 22-25; 2016) omitted 
Gerbrand Ceder’s first name. 
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STUCK IN 
THE MIDDLE 


Eric Vilain built a career studying aspects of 
sex that make some people uncomfortable. 
Now things are getting uncomfortable for him. 


BY SARA REARDON 


sa medical student in Paris in the 1980s, 

Eric Vilain found himself ponder- 

ing the differences between men and 

women. What causes them to develop 

differently, and what happens when the 

process goes awry? At the time, he was 
encountering babies that defied simple classifi- 
cation as a boy or girl. Born with disorders of sex 
development (DSDs), many had intermediate 
genitalia — an overlarge clitoris, an undersized 
penis or features of both sexes. 

Then, as now, the usual practice was to 
operate. And the decision of whether a child 
would be left with male or female genitalia 
was often made not on scientific evidence, says 
Vilain, but on practicality: an oft-repeated, if 
insensitive, line has it that “it’s easier to dig a hole 
than build a pole”. Vilain found the approach 
disturbing. “I was fascinated and shocked by 
how the medical team was making decisions.” 

Vilain has spent the better part of his career 
studying the ambiguities of sex. Now a paedia- 
trician and geneticist at the University of Cali- 
fornia, Los Angeles (UCLA), he is one of the 
world’s foremost experts on the genetic deter- 
minants of DSDs. He has worked closely with 
intersex advocacy groups that campaign for 
recognition and better medical treatment — a 
movement that has recently gained momentum. 
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And in 2011, he established a major longitudinal 
study to track the psychological and medical 
well-being of hundreds of children with DSDs. 

Vilain says that he doesn’t seek out con- 
troversy, but his research seems to attract it. 
His studies on the genetics of sexual orienta- 
tion — an area that few others will touch — have 
attracted criticism from scientists, gay-rights 
activists and conservative groups alike. He 
is also a medical adviser for the International 
Olympic Committee, which about five years 
ago set controversial rules by which intersex 
individuals are allowed to compete in women’s 
categories. 

But what has brought Vilain the most grief 
of late has been his stance on sex-assignment 
surgery for infants with DSDs. Although he 
generally opposes it, he won't categorically 
condemn it or the doctors who perform it. As 
a result, many intersex advocates who object 
to the practice now see him as a hindrance to 
their cause. In November, nine bioethicists and 
activists resigned as advisers to his longitudinal 
study in protest. “I just lost my patience,’ says 
Alice Dreger, a bioethicist who used to work at 
Northwestern University in Evanston, Illinois, 
and who was among the first to leave the study. 

Although dismayed by their departure, Vilain 
refuses to take a stance until it is supported by 


© 2016 Macmillan Publishers Limited. All rights reserved. 


science. “The thing I don’t want to compromise 
is scientific integrity, even when it clashes with 
the community narrative.” 


BREAKING BINARY 

The idea that there are only two sexes is so 
entrenched in society that the first ques- 
tion many people ask on finding out that a 
friend is pregnant is: boy or girl? “People don't 
answer Tm having a baby,’ says Vilain. “They 
probably should” 

At Necker University Hospital for Sick 
Children in Paris in the 1980s, he says, doctors 
presumed that a child would be psychologically 
damaged ifhe or she did not have normal-look- 
ing genitalia. In Vilain’s experience, that belief 
was so strong that doctors would take genital 
abnormalities into account when deciding how 


DAVID WALTER BANKS 


hard to fight to save a premature baby. “The 
unanimous feeling was that boys with a micro- 
penis could never achieve a normal life — that 
they were doomed,’ he says. (The paediatric- 
surgery department at Necker refused to 
answer questions relating to past or current 
standards of care.) 

DSDs occur in an estimated 1-2% of live 
births, and hundreds of genital surgeries are 
performed on infants around the world every 
year’. But there are no estimates as to how often 
achild’s surgically assigned sex ends up differ- 
ent from the gender they come to identify with. 

What do exist, however, are stories of people 
who say that they have been harmed: children 
who struggle to fit in with peers, adolescents 
who are stressed, harassed or attempt suicide, 
and adults who are furious that they were not 


involved in the decision to modify their bodies. 
Over the past two decades, and especially in 
the past few years, intersex activists world- 
wide — some of whom do not identify as either 
gender — have begun to speak out against the 
practice. Unless a child’s life is in danger, they 
argue, he or she should have the right to decide 
on surgery when older. 

Vilain’s fascination with the biological 
complexities of sexual differentiation made him 
want to study the causes of DSDs. So in 1990, he 
joined the lab of geneticist Marc Fellous at the 
Pasteur Institute in Paris. Fellous was studying a 
newly discovered gene called SRY, which resides 
on the Y chromosome and is crucial in trigger- 
ing the development of male features. Vilain 
helped to identify the causes of several DSDs, 
such as XY people who look female because 
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of mutations that disable the SRY gene’, and 
people who carry a copy of SRY even if they do 
not havea Y chromosome’. Vilain was an unu- 
sual student, Fellous says, because his clinical 
background allowed him to bridge lab work and 
patient care. Fellous says that it is often difficult 
to explain to the families of children with DSDs 
why the research would be helpful. “Eric was 
useful for this,” he says. “He was a very open 
mind, really close to families.” 

In 1995, Vilain left France for a faculty job 
at UCLA. There, he began tackling questions 
about sexual development from every possible 
angle. He created mouse models with muta- 
tions in SRY or other sex-linked genes to study 
how their developing brains respond to hor- 
mones — research that could lead to better care 
for people with DSDs. 
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Perhaps most notoriously, he has explored 
the roots of sexual orientation, work that made 
even his colleagues uncomfortable. In 2006, 
he was looking to publish work by his postdoc 
Sven Bocklandt, who had found links between 
the way genes are expressed from a mother’s 
X chromosomes and the chances of her having 
a gay son. When he approached biostatisticians 
for help, several refused to collaborate, Vilain 
says, because they were afraid of how the public 
might respond. 

Studies on the genetic underpinnings of 
homosexuality are controversial. Religious con- 
servatives who believe that being gay is a choice 
argue that scientists are trying to legitimize it; 
gay activists worry that the research will lead to 
misguided attempts to ‘cure’ gay people. Vilain 
gets occasional attacks from both groups. But he 
says that his colleagues’ squeamishness around 
controversial research was unscientific. So, he 
stormed into the office of the UCLA biostatistics 
chair, Kenneth Lange, to complain. 

“Eric’s not afraid to kick up some dust and 
stand up for the people in his lab,” Bocklandt 
says. “I think that’s why he’s been so successful.” 
A statistician eventually volunteered to help. 

Dean Hamer, a retired geneticist formerly at 
the US National Cancer Institute in Bethesda, 
Maryland, trained Bocklandt and has studied 
the genetics of sexual orientation. He says that 
Vilain is pretty much the only geneticist who 
still does serious research on the topic. “That 
takes a level of courage and belief that ultimately 
the biology will win out,” he says. 


COURTING ADVOCACY 

Vilain’s research and interest in policy has put 
him on the front lines of the lesbian, 
gay, bisexual, transgender and queer 
(LGBTQ) rights movement and has 
made his lab a magnet for LGBTQ 
students. His work also made him 
a sort of scientist-laureate for the 
intersex advocacy community, 
which started gaining prominence 
in the early 1990s with the forma- 
tion of the Intersex Society of North 
America in Rohnert Park, Califor- 
nia. The group, founded by activist 
Bo Laurent, lobbied for recognition 
of intersex as a human condition 
rather than an affliction, and opposed infant 
surgery. 

Vilain, who met Laurent in 1997, says that 
she helped to shape his opinions on surgery 
and other topics that are important to intersex 
people, such as the stigma they face. Although 
Laurent and her colleagues were well informed 
and knowledgeable about the science of DSDs, 
they struggled to be heard in scientific conversa- 
tions. “I think the view was that they were zeal- 
ots,’ Vilain recalls. 

In 2005, several paediatric societies met in 
Chicago to draft a consensus statement on the 
management of intersex conditions — a still- 
influential document’ that guides the standard 
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of care. Laurent attended the meeting hoping 
to see the word hermaphrodite struck from 
the medical vocabulary. The term was not only 
offensive — it labelled a person rather than a 
disorder — it was also scientifically inaccurate 
because it suggested that the person had func- 
tioning male and female organs. 

Rather than being heard, Laurent recalls 
being sidelined. But Vilain, who headed the 
genetics working group, met with her in secret 
throughout the meeting, drafting a case to 
present to the group. They met stiff opposi- 
tion from medical doctors, who saw no reason 
for change, but their language was ultimately 
adopted in the final statement". 

Over the years, Vilain continued to build a 
reputation as an ally to intersex people. In 2011, 
when he and psychologist David Sandberg at 
the University of Michigan in Ann Arbor began 
the ten-institution registry to track children 
with DSDs, ethicists and activists enthusiasti- 
cally joined its advisory board. Funded by the 
US National Institutes of Health, the Disorders 
of Sex Development Translational Research 
Network has enrolled more than 300 children, 
collecting medical records and blood samples 
and performing interviews to answer a variety 
of biological and psychological questions. 

Many of the advocates who joined as advisers 
had hoped that development of the network 
would lead to a denouncement of infant geni- 
tal surgery by revealing the damage that it can 
cause. “No one has demonstrated anything but 
harm,” says Anne Tamar-Mattis, legal director 
of the intersex advocacy group interACT in San 
Francisco, California. “Research that settles that 
question is useful” 


“Youre basically calling 
doctors torturers when 
they’ re doing something 


considered standard 
medical practice.” 


But the study has yet to do what advocates 
hoped. Sandberg, who heads the network's psy- 
chological research, has collected evidence that 
emotional and social support from the family 
is the most important contributor to the psy- 
chological and mental health of a child with a 
DSD. He suspects that it has an even greater 
impact than surgery. “I never question people's 
experiences,’ Sandberg says of the activists who 
believe that surgery is always harmful. “What 
I do question is whether they're generalizable.” 

One argument in favour of infant surgery is 
that a child could be psychologically scarred 
by growing up with intermediate genitalia, but 
there is little evidence for or against that. In rare 
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cases, surgery could help to prevent cancer. 
Complete androgen insensitivity syndrome, 
for instance, confers an increased risk of tes- 
ticular cancer that can be lowered through sur- 
gery’. But Vilain points out that the risk before 
puberty is very small°, suggesting that surgery 
could wait. 

Although few surgeons were willing to talk 
openly about infant genital surgery, some do 
argue that the fear of harm is overblown or at 
least outdated. Laurence Baskin, a paediatric 
urologist at the University of California, San 
Francisco, says that the days of “assigning gen- 
der” are long gone, because scientists no longer 
believe that a child can be made to bea boy ora 
girl. Most DSDs can be diagnosed and the out- 
comes predicted; physicians use the diagnosis 
to advise parents on which gender the child 
is likely to identify with, he says. For instance, 
the most common cause for a DSD is congeni- 
tal adrenal hyperplasia — which can result in 
ambiguous genitalia for XX children. Between 
90% and 95% of people with the condition 
identify as female’. 

When asked about children with this disorder 
who ultimately do not identify as female, 
another paediatric urologist — who wished 
not to be named — argues that the process can 
be reversed. People have sex-change surgery as 
adults all the time, he says. 

Such arguments infuriate Tamar-Mattis. “If 
one time in 20 youre cutting a little boy’s penis 
off, is that a risk worth taking?” she says. 

Vilain doesn't think so, and doesn't generally 
recommend surgery to his patients. He says that 
in his experience, more parents are now choos- 
ing to delay surgery. 

But he and his collaborators on 
the longitudinal study are reluc- 
tant to condemn surgery outright 
— they prefer to approach each 
case individually and to consider 
the views of parents who may feel 
strongly about what is right for 
their child. 

This attitude helped to create the 
rift between the researchers and 
intersex advocates. At the end of 
2015, Dreger, who had served as 
the bioethicist for the longitudinal 
study announced her resignation 
ina blogpost. “I can't continue to help develop 
‘conversations’ around ‘shared decision making’ 
that allow decisions to be made that I believe 
violate the most basic rights of these children,” 
she wrote. “I am fed up with being asked to be 
a sort of absolving priest of the medical estab- 
lishment’ 

Vilain was blind-sided by the post. “I was very 
saddened by this,’ he says. “She's a friend” After 
her departure, eight other advocates sent the 
study’s leaders a letter of resignation. 

Most would not comment on the record, but 
say that they were upset that the researchers 
were making decisions about which questions 
to pursue without sufficiently consulting them. 
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The controversy surrounding female runner Caster Semenya’s participation in international competitions 
helped lead to rule changes about who can compete in women’s events. 


For example, advocates are also concerned 
about the psychological impacts on children 
from having their genitals photographed for the 
purpose of diagnosis or to plan treatment. Some 
say that Vilain became hostile in meetings. They 
accuse him and Sandberg of putting research 
interests ahead of human suffering. 

“We live in a community of people who have 
experienced the harms of these practices,” says 
Arlene Baratz, a radiologist who serves as medi- 
caladviser toa DSD support group and is one of 
those who resigned from the study. She and oth- 
ers say that in their decades of work as advocates 
they have never been contacted by someone 
who was helped by surgery. 

Vilain says that he does talk to such patients 
in his practice, but because they are living happy 
lives, they have no reason to speak out. With- 
out data on outcomes, says Douglas Diekema, 
a medical ethicist at the Seattle Children’s 
Research Hospital Institute in Washington, it is 
impossible to weigh up whether surgery is over- 
all harmful, helpful or neutral for most people. 
“Good ethics requires good data,’ he says. 

But a legal battle in the United States could 
change medical practice before those data are in. 
Tamar-Mattis is one of the lawyers representing 
the family of a baby who underwent feminiz- 
ing surgery at 16 months old. The child, now 
11 years old, identifies as male, and his law- 
yers argue that South Carolina's Department 
of Social Services and the university that per- 
formed the surgery violated the child's rights. 

Intersex advocates are watching the case with 
great interest, because it could lay the ground- 
work for future suits that could effectively 


outlaw the procedures in the United States. 

In January, the United Nations released a 
report saying that sex-assignment surgeries on 
infants “lead to severe and life-long physical and 
mental pain and suffering and can amount to 
torture and ill-treatment”. Vilain and Sandberg 
worry that the language could alienate doctors 
and parents alike. “You're basically calling doc- 
tors torturers when they’re doing something 
considered standard medical practice,’ Vilain 
says. He points out that few medical proce- 
dures are governed by law — physicians tend to 
operate according to guidelines and principles. 
“Tm not opposed to guidelines, I’m opposed to 
things that completely alter medical practice in 
an irreversible way,’ he says. He and Sandberg 
also worry that legal bans could drive infant 
surgery underground. “Parents are scared. You 
just don't dictate to them and say get over it? 
Sandberg says. 


TESTING PATIENCE 

Vilain’s expertise has plunged him into other 
controversies. One example is his involvement 
with the International Olympic Committee, 
which in 2011 revised its policy on athletes who 
identify as female but who have male sex organs 
or produce high levels of testosterone. 

The issue came to the fore in 2009 after 
18-year-old South African runner Caster 
Semenya, who identifies as female, was sub- 
jected to humiliating sex testing before being 
allowed to continue competing in the women’s 
category. 

To head off future problems, the medical 
advisory board, under Vilain’s leadership, drew 
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a bright line for the 2012 Olympics. People with 
testosterone levels above 10 nanomoles per 
litre of blood could not participate in women's 
events, no matter how they identify. Exceptions 
are made only ifthe athletes can prove that they 
are resistant to the effects of testosterone. 

Many activists and ethicists are furious 
about the policy. “It bears noting that athletes 
never begin on a fair playing field; if they were 
not exceptional in one regard or another, they 
would not have made it to a prestigious interna- 
tional athletic stage,’ wrote bioethicist Katrina 
Karkazis from Stanford University in California 
in a 2012 article® lambasting the policy. 

Even Vilain struggles to defend it on scien- 
tific grounds. Although women with DSDs 
that result in high testosterone levels are over- 
represented among Olympians, the hormone 
does not seem to directly impact their perfor- 
mance. “It is very imperfect,” he admits. “But 
if we dont have a dividing line, then there is 
no point in segregating sexes in sports.” (The 
policy has been temporarily suspended and is 
under review.) 

Some of Vilain’s detractors question how he 
can support a somewhat arbitrary call in this 
situation while requiring more evidence to 
condemn infant surgery. But sport, he argues, 
depends on rules and policies, whereas medi- 
cine relies on best-practice guidelines — and 
that is what he hopes to develop through 
research. 

He and his collaborators plan to continue 
the longitudinal study. The team has recruited 
a bioethicist, John Lantos of Children’s Mercy 
Hospital in Kansas City, to replace Dreger, and 
it still has some patient advocates involved. 
Vilain says that he is trying not to antago- 
nize anyone — the next iteration will include 
research on more questions that the partici- 
pants say are priorities, such as how to preserve 
fertility for people with DSDs and identifying 
cancer risks. 

Yet Vilain’s experiences with patient advo- 
cates have hardened him somewhat. “I call the 
ones who work with us advocates; those against 
us activists,” he says. He remains driven by ques- 
tions about sex, even if it kicks up dust. “We're 
trying to listen to the community, but by the 
same token we're committed to producing data 
and evidence? m 


Sara Reardon writes for Nature from 
Washington DC. 
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As cyberattacks grow ever more sophisticated, those who defend 
against them are embracing behavioural science and economics to 
understand both the perpetrators and their victims. 


BY M. MITCHELL WALDROP 


Sasse, “their victims rave about the customer service”. 

Sasse is talking about ransomware: an extortion 
scheme in which hackers encrypt the data on a user’s com- 
puter, then demand money for the digital key to unlock 
them. Victims get detailed, easy-to-follow instructions for 
the payment process (all major credit cards accepted), and 
how to use the key. If they run into technical difficulties, 
there are 24/7 call centres. 


ay what you will about cybercriminals, says Angela 
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“It’s better support than they get from their own Internet 
service providers,” says Sasse, a psychologist and computer 
scientist at University College London who heads the 
Research Institute in Science of Cyber Security. That, she 
adds, is today’s cybersecurity challenge in a nutshell: “The 
attackers are so far ahead of the defenders, it worries me 
quite a lot.” 

Long gone are the days when computer hacking 
was the domain of thrill-seeking teenagers and college 
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students: since the mid-2000s, cyberattacks have become 
dramatically more sophisticated. Today, shadowy, state- 
sponsored groups launch exploits such as the 2014 hack of 
Sony Pictures Entertainment and the 2015 theft of millions 
of records from the US Office of Personnel Management, 
allegedly sponsored by North Korea and China, respec- 
tively. “Hacktivist’ groups such as Anonymous carry out 
ideologically driven attacks on high-profile terrorists and 
celebrities. And a vast criminal underground traffics in 
everything from counterfeit Viagra to corporate espio- 
nage. By one estimate, cybercrime costs the global economy 
between US$375 billion and $575 billion each year’. 

Increasingly, researchers and security experts are real- 
izing that they cannot meet this challenge just by build- 
ing higher and stronger digital walls around everything. 
They have to look inside the walls, where human errors, 
such as choosing a weak password or clicking on a dodgy 
e-mail, are implicated in nearly one-quarter of all cyber- 
security failures’. They also have to look outwards, tracing 
the underground economy that supports the hackers and 
finding weak points that are vulnerable to counterattack. 

“We've had too many computer scientists looking at 
cybersecurity, and not enough psychologists, economists 
and human-factors people,’ says Douglas Maughan, head 
of cybersecurity research at the US Department of Home- 
land Security. 

That is changing — fast. Maughan’s agency and other 
US research funders have been increasing their spending 
on the human side of cybersecurity for the past five years 
or so. In February, as part of his fiscal-year 2017 budget 
request to Congress, US President Barack Obama proposed 
to spend more than $19 billion on federal cybersecurity 
funding — a 35% increase over the previous year — and 
included a research and development plan that, for the first 
time, makes human-factors research an explicit priority. 

The same sort of thinking is taking root in other countries. 
In the United Kingdom, Sasse’s institute has a multiyear, 
£3.8-million (US$5.5-million) grant from the UK govern- 
ment to study cybersecurity in businesses, governments 
and other organizations. Work from the social sciences is 
providing an unprecedented view of how cybercriminals 
organize their businesses — as well as better ways to help 
users to choose an uncrackable yet memorable password. 

The fixes are not easy, says Sasse, but they’re not impos- 
sible. “We've actually got good science on what does and 
doesn't work in changing habits,” she says. “Applying those 
ideas to cybersecurity is the frontier.” 


KNOW YOUR AUDIENCE 

Imagine that it is the peak of a harried work day, and a 
legitimate-looking e-mail lands in your inbox: the com- 
pany’s computer team has detected a security breach, it says, 
and everyone needs to run an immediate background scan 
for viruses on their machines. “There’s a tendency to just 
click ‘accept’ without reading,” says Adam Joinson, a social 
psychologist who studies online behaviour at the University 
of Bath, UK. Yet the e-mail is a fake — and that hasty, exas- 
perated click sends malware coursing through the company 
network to steal passwords and other data, and to convert 
everyone's computers into a zombie ‘botnet’ that fires off 
more spam. 

The attackers, it seems, have a much better grasp on user 
psychology than have the institutions meant to defend 
them. In the scenario above, the success of the attack relies 
on people’s instinctive deference to authority and their 
lowered capacity for scepticism when they’re busy and 


distracted. Companies, by contrast, tend to impose security 
rules that are disastrously out of sync with how people 
work. Take the ubiquitous password, by far the simplest 
and most common way for computer users to prove their 
identity’. One study’, released in 2014 by Sasse and others, 
found that employees of the US National Institute of Stand- 
ards and Technology (NIST), headquartered in Gaithers- 
burg, Maryland, averaged 23 ‘authentication events’ per day 
— including repeated logins to their own computers, which 
locked them out after 15 minutes of inactivity. 

Such demands represent a substantial drain on 
employees’ time and mental energy — especially for 
those who try to follow the standard password guidelines. 
These insist that people use a different password for each 
application; avoid writing passwords down; change them 
regularly; and always use a hard-to-guess mix of symbols, 
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CYBERCRIME COSTS THE GLOBAL 


ECONOMY BETWEEN }370BILLION 
AND S70 BILLION EACH YEAR, 


numbers and uppercase and lowercase letters. 

So people resort to subversion. In another systematic 
study of password use in the real world’, Sasse and her 
colleagues documented the ways in which workers at a 
large multinational organization side-stepped the official 
security requirements without (they hoped) being totally 
reckless. The employees’ methods — writing down a list of 
passwords, for example, or transferring files between com- 
puters using unencrypted flash drives — would be familiar 
in most offices, but essentially created a system of ‘shadow 
security that kept the work flowing. “Most people's goal is 
not to be secure, but to get the job done,” says Ben Laurie, 
who studies security compliance at Google Research in 
London. “And if they have to jump through too many 
hoops, they will say, “To hell with it.” 

Researchers have uncovered multiple ways to ease this 
impasse between workers and security managers. Lorrie 
Cranor directs the CyLab Usable Privacy and Security Labo- 
ratory at Carnegie Mellon University in Pittsburgh, Pennsyl- 
vania — one of several groups worldwide that are looking at 
ways to make password policies more human-compatible. 

“We got started on this six or seven years ago, when Car- 
negie Mellon changed its password policy to something 
really complicated,” says Cranor, who is currently on leave 
from the university to serve as chief technologist at the US 
Federal Trade Commission in Washington DC. The univer- 
sity said that it was trying to conform to standard password 
guidelines from NIST. But when Cranor investigated, she 
found that these guidelines were based on educated guesses. 
There were no data to base them on, because no organiza- 
tion wanted to reveal its users’ passwords, she says. “So we 
said, “This is a research challenge” 
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TOP 10 MOST 
COMMON 
PASSWORDS 
2015 


1.123456 

2. password 
3. 12345678 
4. qwerty 

5. 12345 

6. 123456789 
7. football 

8. 1234 

9. 1234567 
10. baseball 


SOURCE: SPLASHDATA 
(GO.NATURE.COM/BIXMEK) 
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Cranor and her colleagues put a wide range of password 
policies to the test® by asking 470 computer users at Carn- 
egie Mellon to generate new passwords based on different 
requirements for length and special symbols. Then they 
tested how strong the resulting passwords actually were, 
how much effort was required to create them, how easy 
they were to remember — and how annoyed at the system 
the participants became. 

One key finding’ was that organizations should forget the 
standard advice that complex gobbledygook words such as 
0s7G0*7j%x$a are safest. “It’s easier for users to deal with 


CHANGING PASSWORDS EVERY 
SO DAYS RANKS BETWEENUSELESS 


AND COUNTERPRODUCTIVE 


password length than password complexity,’ says Cranor. 
An example of a secure but user-friendly password might 
be a concatenation of four common but randomly chosen 
words — something like usingwoodensuccessfuloutline. At 
28 characters, it is more than twice as long as the gibber- 
ish example, but much easier to remember. As long as the 
system guards against people making stupid choices such as 
passwordpassword, says Cranor, strings of words are quite 
hard for attackers to guess, and provide excellent security. 


TIME FOR A CHANGE 

Another key finding, says Cranor, is that unless there is 
reason to think that the organization's security has been 
compromised, the standard practice of forcing users to 
change their passwords on a 30-, 60- or 90-day schedule 
ranks somewhere between useless and counterproductive 
(see go.nature.com/2vq6r4). For one thing, she says, stud- 
ies show’ that most people respond to such demands by 
choosing a weaker password to begin with, so that they can 
remember it, and then making the smallest change that they 
can get away with. They might increase a final digit by one, 
for example, so that password2 becomes password3 and so 
on. “So ifa hacker guesses your password once,’ she says, 
“jt won't take them many tries to guess it again.” 

Besides, she says, one of the first that things hackers do 
when they break in is to install a key-logging program or 
some other bit of malware that allows them to steal the new 
password and get in whenever they want. So again, says 
Cranor, “changing the password doesnt help”. 

Sasse sees encouraging signs that such critiques are being 
heard. “For me, the milestone was last year when GCHQ 
changed its advice on passwords,’ she says, referring to the 
Government Communications Headquarters, a key UK 
intelligence agency. GCHQ issued a public document’, 
containing several citations to the research literature, that 
gave up on long-established practices such as demanding 
regular password changes, and instead urged managers to 
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be as considerate as possible towards the people who have 
to live with their policies. “Users have a whole suite of pass- 
words to manage, not just yours,’ goes one bit of advice. 
“Only use passwords where they are really needed.” 


ATTACK THE ATTACKERS 
If research can uncover weak points in user behaviour, 
perhaps it can also find vulnerabilities among the attackers. 

In 2010, Stefan Savage, a computer scientist at the Univer- 
sity of California, San Diego, and his team set up" a cluster of 
computers to act as what he calls “the most gullible consumer 
ever”. The machines went through reams of spam e-mails col- 
lected from several major antispam companies, and clicked 
on every link they could find. The researchers focused on 
illegal pills, counterfeit watches and handbags, and pirated 
software — three of the product lines most frequently adver- 
tised in spam — and bought more than 100 items. Then they 
used specially designed web-crawling software to track back 
through the spammers’ supply network. If an illicit vendor 
registered a domain name, made payments to a supplier or 
used a bank to accept credit-card payments, the researchers 
could see it. The study exposed, for the first time, the entire 
business structure of computer criminals — and revealed 
how surprisingly sophisticated it was. 

“Tt was the ultimate hothouse of weird new entrepreneur- 
ial ideas,’ says Savage, “the purest form of small-business 
capitalism imaginable — because there is no regulation.” 
Yet there was order, even so. “Say you have a criminal activ- 
ity you want to engage in,” Savage explains — for exam- 
ple, selling counterfeit drugs. You set up shop by creating 
the website and the databases, striking a deal with a bank 
to accept credit-card payments and creating a customer- 
service arm to deal with complaints — all the back-end 
parts of the business (see ‘A tangled web). 

“You don't send the spam yourself; says Savage. “You 
open that up to affiliates” — specialists who know how to 
send reams of clickable messages that fool people's spam fil- 
ters. “They get 30-40% of the purchase price for any order 
they bring to you,’ he says. And if the brilliant idea turns out 
to bea dud — well, they just go and spam for someone else. 

This affiliate business model has been confirmed in sub- 
sequent studies by Savage and many others”, and turns out 
to apply to a broad range of cybercrimes, from the sale of 
knock-off handbags to ransomware, credit-card piracy and 
other forms of cybertheft. All are supported by the same 
underground economy of affiliate services, of which spam 
generation is only one. Others range from companies in 
India where people spend their days typing in characters 
from CAPTCHA symbol-recognition tests — thus ‘proving’ 
that a malicious program is human — to an up-and-coming 
spam alternative known as search-engine poisoning, in 
which people who click on legitimate-looking search results 
are redirected to malicious websites. 

Unfortunately for law-enforcement agencies, tracing the 
structure of this underground economy rarely helps them 
to arrest the individuals involved; real-world identities tend 
to be closely guarded behind online pseudonyms. And in 
any case, the criminal cyberinfrastructure is remarkably 
resilient. In October 2013, for example, the FBI managed to 
shut down Silk Road, an eBay-like website that linked buy- 
ers and sellers for illicit commodities including hard drugs. 
Silk Road 2.0 appeared online a month later. And when the 
FBI shut down that site in late 2014, still others popped up. 

However, researchers have uncovered some potentially 
more effective ways to attack the underground economy. 
Savage and his colleagues found” that by far the weakest 
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links were the banks that processed credit-card payments 
to the profit centres. They were at the mercy of the credit- 
card companies, whose contracts generally state that any 
bank that represents a merchant must guarantee that a 
sale is legal — and is liable for paying customers back if 
they complain. Few banks were willing to take such risks. 
“Tt turned out that 95% of counterfeit spam on the planet 
went through just three banks,” says Savage: one each in 
Azerbaijan, Latvia and St Kitts and Nevis. In November 
2011, Microsoft worked with Visa to pressure those banks 
to drop the vendors that were pirating its products. “And for 
18 months,” says Savage, “there was no one selling pirated 
Microsoft software on the Internet.” 

It was not a permanent solution, however: banking sup- 
port for the shady software vendors eventually moved to 
east Asia, where Western companies and law-enforcement 
agencies have considerably less leverage. Still, the hope is 
that continuing research will be able to make a big dif- 
ference in the long run. “For the first time,” says Nicolas 
Christin, a computer engineer who studies the human side 
of cybersecurity at Carnegie Mellon, “we have vast amounts 
of data about the underground economy.’ 

Try that in the real world, says Christin: anyone who 
wanted to understand, say, the street-drug trade in Pitts- 
burgh would have to go undercover and risk getting killed. 
And even then, they would get only a fragmentary, ad hoc 
glimpse of the whole picture. 

But in the online world, every transaction leaves a digital 
trail, says Christin, who has led research on Silk Road — 
especially when payments are made using digital currencies 
such as Bitcoin. “And for an economist, that’s wonderful” 
Christin and others in this field have watched criminal 
systems grow, mature and be taken down — and others 
spring up in their place. They have watched coalitions form 
and dissolve, and tracked how the flow of money between 
criminals helps them to build trust. 

“We're just beginning to scratch the surface on the 


analysis,” says Christin. But he foresees this flood of data 
resulting in a new fusion of computer science with social 
science and conventional law enforcement. “It may actu- 
ally be very fruitful ground for refining and testing existing 
theories of criminal behaviour,” he says. 

Savage has a similar hope. Whether the focus is inward- 
or outward-looking, he says, “There is so much snake oil 
around security. Very few decisions are based on data.” 
Continuing research could help people to base more of 
those decisions on evidence, he says. “But to do that, you 
have to look at the people involved — their motivations 
and incentives.” m 


M. Mitchell Waldrop is a features editor for Nature in 
Washington DC. 
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ANNA TARNHUVUD 


Human-embryo research is governed by a policy that aims to accommodate diverse moral concerns. 


Revisit the 14-day rule 


Studies of human development in vitro are on a collision course with an 
international policy that limits embryo research to the first two weeks of 
development, warn Insoo Hyun, Amy Wilkerson and Josephine Johnston. 


n4 May, two groups reported that 
() they had sustained human embryos 
in vitro for 12-13 days'*. Embryos 
normally implant in the wall of the uterus 
at around day seven. Until now, no one had 
reported culturing human embryos in vitro 
beyond nine days’, and rarely have they been 
sustained for more than seven. 
This latest advance comes only 


21 months after the researchers at the 
Rockefeller University in New York City 
(some of whom are involved in the latest 
embryo-culturing work) announced that, 
under certain conditions, individual human 
embryonic stem cells can self-organize into 
structures akin to the developmental stages 
of embryos soon after implantation”* (see 
“Two advances in human developmental 


© 2016 Macmillan Publishers Limited. All rights reserved. 


biology’). The cells were obtained from 
pre-existing stem-cell lines (derived from 
4—5-day-old embryos donated through 
fertility clinics). 

In principle, these two lines of research 
could lead to scientists being able to study 
all aspects of early human development 
with unprecedented precision. Yet these 
advances also put human developmental 
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> biology on a collision course with the 
‘14-day rule’ — a legal and regulatory line in 
the sand that has for decades limited in vitro 
human-embryo research to the period 
before the ‘primitive streak’ appears. This 
is a faint band of cells marking the begin- 
ning of an embryo’s head-to-tail axis. 

The 14-day rule has been effective for 
permitting embryo research within strict 
constraints — partly because it has been 
technologically challenging for scientists to 
break it. Now that the culturing of human 
embryos beyond 14 days seems feasible, 
more clarity as to how the rule applies to 
different types of embryo research in dif- 
ferent jurisdictions is crucial. Moreover, in 
light of the evolving science and its poten- 
tial benefits, it is important that regula- 
tors and concerned citizens reflect on the 
nature of the restriction and re-evaluate its 
pros and cons. 


POLICY TOOL 

The 14-day limit was first proposed in 
1979 by the Ethics Advisory Board of the 
US Department of Health, Education, 
and Welfare’. It was endorsed in 1984 by 
the Warnock committee in the United 
Kingdom’, and in 1994 by the US National 
Institutes of Health’s Human Embryo 
Research Panel’. 

In at least 12 countries, this limit is 
encoded in laws governing assisted reproduc- 
tion and embryo research (see ‘International 
agreement’). The rule is also embodied in 
numerous reports commissioned by gov- 
ernments, and in scientific guidelines for 
embryo and assisted-reproduction research. 
These include China’s 2003 Ethical Guiding 
Principles on Human Embryonic Stem Cell 
Research and India’s 2007 Guidelines for 
Stem Cell Research and Therapy. 

Some versions of the rule cover embryos 
created by any means; others apply only to 
products of fertilization. Some explicitly 
refer to gastrulation (when three differ- 
ent cell layers appear) or the formation of 
the primitive streak; others mention only 
the 14 consecutive days of development. 
In most cases, however, what seems to be 
crucial is the stage of development that 
the 14th day typically represents, not the 
consecutive number of days in culture. 

The formation of the primitive streak is 
significant because it represents the ear- 
liest point at which an embryo’s biologi- 
cal individuation is assured. Before this 
point, embryos can split in two or fuse 
together. So some people reason that at 
this stage a morally significant individual 
comes into being. 

Yet views differ on the moment in devel- 
opment at which a human embryo obtains 
sufficient moral status that research on it 
should be prohibited. Some, for instance, 
believe that the cut-off is the point of 


170 | NATURE | VOL 533 | 12 MAY 2016 


fertilization; others argue that it comes much 
later, when the embryo develops into a fetus 
that can experience pain, exhibit brain activ- 
ity or survive outside the womb. 

Revisiting the 14-day rule might tempt 
people to try to rationalize or attack the 
philosophical coherence of the limit as an 
ethical tenet grounded in biological facts. 
This misconstrues the restriction. The 
14-day rule was never intended to be a 
bright line denoting the onset of moral sta- 
tus in human embryos. Rather, it is a public- 
policy tool designed to carve out a space for 
scientific inquiry and simultaneously show 
respect for the diverse views on human- 
embryo research. 

In fact, as a public-policy instrument, 
the 14-day rule has been tremendously 
successful. It has offered a clear and legally 
enforceable stopping point for research, 
because the primitive streak can be vis- 
ibly identified and it is possible to count 


the number of days that an embryo has 
been cultured in a dish. The alternatives at 
each extreme — banning embryo research 
altogether or imposing no restrictions on 
embryo use — would not have made for 
good public policy in a pluralistic society. 


TWO GOALS 

Scientific advances are now prompting 
re-evaluations of other long-established 
research policies. For instance, it has 
proved difficult to maintain a previous 
consensus among funders, regulators and 
researchers that genetic engineering of 
human cells is permissible as long as those 
cells are not sperm, eggs or embryos. The 
clinical use of mitochondrial-replacement 
therapies — which cause heritable changes 
to future generations — was approved last 
year by the UK government, and deemed 
‘ethically permissible’ earlier this year by a 
committee of the US Institute of Medicine. 


STREAKING AHEAD 


Two advances in human developmental biology 


In 2014, researchers at the Rockefeller 
University in New York City placed human 
embryonic stem cells on plastic discs with 
patterned surfaces designed to support 

cell clustering, and treated the cells with 

a bone growth factor’®. In one or two days, 
the cells had arranged themselves into 
radially symmetric patterns. These mirror 
— in flattened form — the organization 

of embryos soon after implantation in the 
uterine wall. From the outside in, concentric 
circles of cells form each of the three germ 
layers that give rise to all fetal tissues: the 
endoderm, mesoderm and ectoderm cells. 
These in vitro models even show evidence of 
primitive-streak-like regions. 

These self-organizing structures, 
although embryo-like, are essentially 
two-dimensional. Other in vitro models 
have demonstrated some degree 


Human embryonic stem cells form 
self-organized spatial patterns. 
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of self-organization when human 
embryonic stem cells are cultured in 3D 
environments’°. 

All these differ from implanted human 
embryos resulting from a fertilized egg 
in several ways. But it is plausible that 
researchers could one day create more 
comprehensive 3D models°. 

This week, teams led by researchers 
at the Rockefeller University and the 
University of Cambridge, UK, report’? that 
they cultured intact human embryos in vitro 
and obtained interpretable images up to 
day 12-13. Each team stopped its studies 
by day 14 in accordance with UK law and 
international guidelines. 

The latest work provides compelling 
insight into how the early human embryo 
transitions from a floating hollow ball of 
cells to a three-layered gastrula attached 
to the uterus. Studies on human embryos 
sustained in culture could help to clarify 
whether self-organizing structures truly 
mimic early human development. Tracking, 
in real time, the morphological and 
molecular changes in embryonic cells and 
the interactions between them during these 
later days of development, could elucidate 
the cell-signalling pathways that guide 
embryo organization and tissue formation. 

These techniques could shed light on 
the disorders that result in early pregnancy 
losses and birth defects, and facilitate 
clinical applications of stem-cell research. 
In conjunction with gene-editing tools, they 
could even help to determine the role of 
specific genes in human development. 


REF. 5 


Some might conclude from such 
developments that policymakers redefine 
boundaries expediently when the limits 
become inconvenient for science. If restric- 
tions such as the 14-day rule are viewed as 
moral truths, such cynicism would be war- 
ranted. But when they are understood to be 
tools designed to strike a balance between 
enabling research and maintaining public 
trust, it becomes clear that, as circumstances 
and attitudes evolve, limits can be legiti- 
mately recalibrated. 

Any decision to revise the 14-day rule 
must depend, however, on how well any 
proposed changes can uphold the rule’s 
two chief goals: supporting research and 
accommodating diverse moral concerns. 

The rule became a standard part of 
embryo-research oversight through the 
convergence of deliberations of various 
national committees over decades. Hun- 
dreds of medical and scientific associations 
submitted recommendations, and dozens 
of public forums were held. Any formal 
changes to this rule should occur through 
similar processes of consensus-building 
involving experts, policymakers, patients 
and concerned citizens. 

Ideally, discussion should begin at 
an international level given the global 
nature of this research — although tak- 
ing local cultural and religious differences 
into account properly would also require 
national-level debates. A complication is 
that in many countries, a revision to the 
14-day rule would involve a legislative 
change. Yet the kind of international dis- 
course that we envision could facilitate 
and inform local decisions to amend law 
or research policy. 

There are precedents for this type of 
international discourse. In response to 
the development of powerful gene-editing 
technologies such as CRISPR-Cas9, the 
US National Academy of Science, the US 
National Academy of Medicine, Britain’s 
Royal Society and the Chinese Academy 
of Sciences jointly hosted an international 
summit in December last year to discuss sci- 
entific, ethical and governance issues raised 
by the research. The second component of 
this initiative — a science and policy review 
and report on human gene editing — is 
ongoing. 


A PATH FORWARD 

Scientists have a crucial part to play in 
this process. In 1985, when the legality of 
human-embryo research in the United 
Kingdom was threatened by a parliamen- 
tary bill, Nature editors appealed to embry- 
ologists to submit explanations of their 
research and its importance — to educate 
policymakers and the public before undue 
restrictions on research were passed (see 
Nature 314, 11; 1985). 


INTERNATIONAL AGREEMENT 


Twelve countries (dark blue) have laws that restrict in vitro research on human embryos to within 
the first 14 days of development. In five others (pale blue), nationally commissioned scientific 
guidelines specify the 14-day rule. Guidelines issued by the International Society for Stem Cell 
Research for the global research community also articulate the 14-day rule. 
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cultures to seven days. 


@ Encoded in law 


© Specified in scientific 
guidelines 


Today, researchers of human developmen- 
tal biology should similarly engage with the 
public about what they are doing and why it 
matters. And they should consider design- 
ing their experiments in a way that, while 
furthering discovery, also addresses people's 
moral concerns. 

In the immediate future, researchers 
should work closely with their local 
research-oversight committees to ensure 

that they are not 


“The 14-day at risk of violating 
rule was never current laws or 
intended to be guidelines. There 
abright line are currently ambi- 
denoting the guities around the 
onset of moral legal definition of 
statusinhuman ‘human embryo’ 
embryos.” in some jurisdic- 


tions, and uncer- 
tainties around the biological potential of 
self-organizing, embryo-like structures®. 

Next week, the International Society for 
Stem Cell Research (ISSCR) will release its 
revised guidelines for stem-cell research. 
These guidelines are the result of a multi- 
national, interdisciplinary task force (which 
included one of us, I.H.) with input from 
stakeholders around the world. One of the 
goals of these guidelines is to provide a 
framework for those concerned about how 
research oversight should proceed in light 
of new forms of embryo research. 

In the short term, we think that the 
ISSCR’s recommended approach to over- 
sight of work involving human embryos 
offers a practical path forward — espe- 
cially if supplemented with input from 
representatives of the many advisory com- 
mittees that have adopted the 14-day rule. 
Obvious candidates are the UK Human 
Fertilisation and Embryology Authority, 
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India 


Australia 
v ? 


New Zealand 


the US National Academies of Sciences, 
Engineering, and Medicine, and the 
Chinese Ministry of Science and Technol- 
ogy and Ministry of Health. 

Close collaboration between these 
organizations could help to prevent a 
public backlash and the implementation 
of reactive, more restrictive limits on 
research. m SEE NEWS & VIEWS P.182 
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David Attenborough in one of his early natural-history documentaries, Zoo Quest, more than 60 years ago. 


Restore our 
sense of species 


Klaas-Douwe B. Dijkstra has named a new 
dragonfly after David Attenborough to mark the 
broadcaster’s 90th birthday — and to honour the 

importance of knowing the natural world. 
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any of the greatest communicators 
Mie been naturalists. E.O. Wilson 

has his ants and Jared Diamond 
his birds. Oliver Sacks studied ferns as well 
as brains. The views of these thinkers were 
opened up by their exploration of the bio- 
sphere, enabling them to expose that vision 
to the world. One of the greatest natural- 
ists of all is not a scholar in the traditional 
sense, but a broadcaster. To mark the 90th 
birthday on 8 May of documentary pioneer 
David Attenborough, I was given the hon- 
our of naming a new species of dragonfly — 
his favourite insect — for him (see ‘David's 
dragonfly’). 

Attenborough’s many hundreds of hours 
of exquisite television are a reminder that a 
primary reason to study nature is enlight- 
enment. I am, of course, not the first to 
name a species for him — at least 15 ani- 
mals and plants now bear his name. But 
doing so has led me to reflect on why we 
need naturalists like Attenborough today 
more than ever. 

Although the impact of humans on all 
other life is beyond apocalyptic, our con- 
sciousness of its diversity is medieval. 
Almost 3 centuries since Carl Linnaeus came 
up with his naming system, perhaps only 
1.2 million of an estimated 8.7 million extant 
eukaryote species (animals, plants, fungi and 
protists) are named, and more than half of 
them are insects such as dragonflies’. Even 
worse than this taxonomic deficit is our 
grasp of natural history. The basic ecology, 
distribution and status of only 80,000 species 
are known well enough to assess their extinc- 
tion risk, and about 29% are in danger (www. 
iucnredlist.org). 

The scale of this incredible richness, 
ignorance and destruction is hard to 
fathom. Imagine for a second that each of 
the estimated 6.5 million terrestrial species 
had an equal share of the total available land 
on Earth. Each species’ plot would cover an 
area only one-quarter the size of Manhattan, 
so the human plot could be walked around 
in just three hours. The 80,000 species we 
are familiar with would only cover an area 
equal to Spain, France and Turkey; and we 
would not even know the natural world 
beyond the combined areas of Europe, India 
and China. Yet we are set to void of life an 
area equivalent to the New World. The bio- 
sphere has been charted as well today as 
the globe was in Christopher Columbus’s 
day, yet the biological apocalypse is already 
complete. 

After spending 32 of my 41 years in 
the field, I’m still agog at life’s splendour. 
The beauty I see as I search for dragon- 
flies on expeditions in Gabon or watch 
birds around Stellenbosch on my way to 
work can be so absurd it makes me laugh, 
so diverse it makes me gasp for air, so 
intense that I binge-watch as if there is a 


DAVID ATTENBOROUGH/BBC 


ERLAND R. NIELSEN 


cliff-hanger in every impression. 

Last year, my colleagues and I described 
60 new dragonfly species at once, adding 
1 species to every 12 known in Africa’. 
Why? To show that most of what is 
unknown, however conspicuous, is sim- 
ply not looked for. The field is empty while 
the labs are full. A student I know, who is 
passionate about exploring beetles — the 
most varied animal group on Earth — has 
ended up studying the gene expression of 
one model species in a lab. 

Intact biodiversity provides undeniable 
proof that we can inhabit our environment 
without destroying it. Just when naturalist- 
taxonomists are needed most to expose the 
evidence, their position has become weak. 
We in the field are partly to blame for this 
marginalization, having too often empha- 
sized the scientific and economic value of 
what we do, thus losing sight of its impact 
beyond science and the economy. Our work's 
greatest justification lies not in biodiversity’s 
enormous direct contribution to human 
well-being, but in the moral counterweight 
that we can offer to life’s runaway exploi- 
tation: biodiversity is the embodiment of 
sustainability. 


CORE VALUES 

Attenborough once said that he had 
“never met a child who was not interested 
in natural history”. For most of our exist- 
ence, humans were hunter-gatherers who 
needed to name and know other species 
to survive. We evolved an affinity with 
nature — what Wilson calls biophilia. This 
is probably why observing nature can be so 
satisfying. If nurtured, this instinct could 
rapidly transform society’. 

The core value of natural history and 
taxonomy is species sense. This is a con- 
sciousness of the existence and impact of all 
species, from plankton to cattle and includ- 
ing humankind. Species make each place 
special and thus worth fighting for. Life is 
like water, a branching river system literally 
a genealogy: with a unique history in every 
separate stream, one human action can 
erase an irreplaceable ecosystem’. Although 
a hydroelectric dam might seem a sapient 
energy solution, it can mean ecocide to a 
‘specient mind — one with species sense. 

Whereas every human relies on this 
species sense, even if only by reaping the 
benefits of agriculture and medicine, few in 
society see it as their primary responsibil- 
ity. With nature held hostage by our grow- 
ing demands, environmental consultants 
and conservationists have little time left to 
find out who they work for. 

Most worryingly, the field of biology itself 
has lost species sense. Biological research 
is an interaction between the inventory 
of life’s diversity and the investigation 
of the forces shaping it. Disciplines shift 


DAVID’S DRAGONFLY 


Field and museum research reveal Madagascan beauty 


Dragonflies do not help to feed us like 
bees and fish do; they are not feared and 
persecuted like mosquitoes and snakes; 
nor are they studied as proxies of human 
psyche and society like ants and apes. 
Their beauty and sensitivity stand for the 
state and needs of nature before our own. 
We admire dragonflies purely for what 
they are — the same unconditional love 
for nature that David Attenborough has 
taught us. 

In few places is the creative force 
of nature and the destructive force of 
humankind more apparent than in 
Madagascar. Fortunately, the dragonfly 
Acisoma attenboroughi’® (pictured) can 
be seen easily there. Kai Schitte and | 


first noticed in the field and in collections 
that this species had been confused 
with its African and Asian counterparts 
for 174 years — a fact confirmed by 
DNA studies in the molecular labs at the 
museums of Leiden in the Netherlands and 
Hamburg in Germany. 

The photographer Erland Nielsen 
joined me on a special tour for dragonfly 
enthusiasts earlier this year to amass 
images and raise funds for a booklet 
introducing these freshwater sentinels 
to the people of Madagascar for the first 
time. Like so many species, Madagascar’s 
spectacular dragonflies have been ignored 
since the European monographs produced 
in bionomy’s heyday in the 1950s. K.-0.8.D. 


Attenborough’s pintail (Acisoma attenboroughi). 


with the advance of theory and methods 
(genetics to genomics is one example). 
Spread across all ranks of life, specializa- 
tions such as entomology and botany are 
more stable but also more isolated and 
introverted. Woven together, information 
and theory make biology strong. 
However, when competition for finan- 
cial support increased half a century ago, 
the seemingly static soloists stood weaker. 
Somehow, the idea that they were old-fash- 
ioned and lacked rigour and impact became 
accepted, thinning the warp of expertise 
that bore the weft of disciplines in biology’s 
fabric’. The strongest despeciation’ of biol- 
ogy occurred in the 1960s to 1980s. Within 
a 40-year period in the United States, text- 
book content related to natural history 
decreased from two pages of every three to 
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just one page; related PhDs fell from two in 
five to one in five (even as the total number 
in biology tripled); and the median number 
of courses on natural history required for 
a biology bachelor’s degree dropped from 
two to zero®. 

This unravelling has also affected the 
ultimate custodians of species sense: natural 
history museums. With decreasing support, 
these have often increased emphasis on the 
separate outcomes of their taxonomic exper- 
tise without reinforcing the foundation itself. 
Public outreach must draw bigger crowds; 
scientists must chase loftier questions; and 
collections must focus on preservation. As 
a result, collections-based research has lost 
ground’. 

The species expertise from such research 
may be the most impactful knowledge > 
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> of biodiversity, used by enthusiasts and 
practitioners every day. Moreover, it con- 
nects the institutes’ legacy, science and public 
functions. Ultimately, the dissolution of their 
core tasks can undermine the very survival 
of museums. Should they keep collections 
that are not used for research or outreach? 
Or focus on the type of research conducted at 
institutes that do not have collections? 

Pressed by human indifference and 
habitat destruction, harried by bureaucracy 
and regulations, marginalized in education, 
and spurned by the science and institutes 
that they founded, many naturalists and 
taxonomists now have a deep feeling of exas- 
peration. With less species sense, how fit are 
today’s biologists to write research papers, 
undertake surveys, manage collections and 
teach our children? 


REVIVE BIONOMY 
We need one name to acknowledge the 
elementary and edifying exploration of life, 
with its own funding mechanisms. Although 
venerable, ‘natural history and taxonomy’ 
can sound clunky, transient and restrictive. 
I like ‘bionomy. Coined 150 years ago by 
Ernst Haeckel as an alternative to ecology, 
bionomy has never been applied widely 
(although ‘bionomics’ 


is used occasionally to “Each species 
describe the natural 18 @world 
history of species). parallel to our 
Like the science Own, invoking 
from which it is asense of 
increasingly divorced, being among 
bionomy isahuman equals.” 


endeavour in its 

own right. Arguing that biology builds on 
bionomy and thus owes it support is moot 
if science’s driving force is advance and not 
consolidation. With independent financ- 
ing, from a global pot, current indicators 
of impact — including journal citations 
and methodological innovation — can 
be replaced by appropriate ones such as 
urgency and applicability. 

Most habitats will be gone before the most 
basic surveys are done. Mapping genomes, 
running models and experiments, perhaps 
even digging fossils, may just have to wait. 
Predicting future biodiversity, or studying 
past extinctions to understand the pre- 
sent, at times seem to be almost fantastical 
excuses for ignoring the loss itself. In an era 
of extinction, there are no greater priorities 
than to accelerate the synthesis of life, sal- 
vage knowledge and increase awareness. To 
do so, we need our strongest familiarity with 
all species. 

Thanks to the technological reinvention 
of natural history and taxonomy, all imagi- 
nable knowledge can now be integrated and 
analysed®. Online repositories and genetic 
tools released a deluge of valuable informa- 
tion, although dwindling expertise struggles 
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to validate it. Citizen science is a powerful 
contributing force, because no research is 
closer to the public’s heart, but it also needs 
authoritative support. We may finally learn 
the true extent and intricacy of biodiversity, 
but innovations have increased the need for 
what they seemed to replace by baring the 
enormity of the expertise gap. 

Bionomers are often criticized for lacking 
shared and achievable goals’. Because 
enlightenment is even more important than 
information, we must invest in people before 
tools. Our target should not be quota of spe- 
cies known or access to passive data, but a 
volume of active experts. If we can senda 
probe to search for unlikely life in space, then 
US$10 billion of global core funding over a 
decade (roughly Turkey’s annual science 
spending) would be a bargain for humanity 
to develop a conscience for all life around us. 

The biodiversity products needed most 
today are not patents or papers, but inven- 
tories, field guides, Red Lists of threatened 
species, teaching materials and media cam- 
paigns. Whether academics or amateur, good 
bionomers have initiative and drive, and only 
need prospects and recognition to deliver 
these products. Prizes that reward their typi- 
cally lifelong personal investment may be the 
best way to double the active expertise world- 
wide. For instance, if a handbook took three 
years of funding to complete, similar funding 
will see more such output. 

Astronomers and astronauts discover the 
Universe; bionomers and bionauts uncover 
life. Just as we feel an instant sense of our 
insignificance when we stare into the cosmos, 
or experience the exhilaration of the expand- 
ing landscape as we ascend a mountain, bion- 
omy stretches our horizon. Each species is a 
world parallel to our own, invoking a sense of 
being among equals. That, I believe, is what 
Attenborough has taught us and what we 
must expand. = 


Klaas-Douwe B. Dijkstra is a dragonfly 
researcher affiliated with Stellenbosch 
University in South Africa and Naturalis 
Biodiversity Center in Leiden, the 
Netherlands. 

e-mail: african.dragonflies@gmail.com 
Twitter @bionomer 


1. Mora, C., Tittensor, D. P.,, Adl, S., Simpson, A. G. B. 
& Worm, B. PLoS Biol. 9,e1001127 (2011). 
2. Dijkstra, K.-D. B., Méziére, N. & Kipping, J. 
Odonatologica 44, 447-678 (2015). 
3. McCord, E. L. The Value of Species (Yale Univ. 
Press, 2012). 
4. Dijkstra, K.-D. B., Monaghan, M. T. & Pauls, S. U. 
Ann. Rev. Entomol. 59, 143-163 (2014). 
5. Pearson, D. L., Hamilton, A. L. & Erwin, T. L. 
Bioscience 61, 58-63 (2011). 
6. Tewksbury, J. J. et al. BioScience 64, 300-310 
(2014). 
. Winker, K. & Withrow, J. J. Nature 493, 480 (2013). 
. Godfray, H. C. J. Nature 417, 17-19 (2002). 
. Wheeler, Q. Syst. Biodiv. 8, 11-15 (2010). 
0.Mens, L. P, Schiitte, K., Stokvis, F. R. & Dijkstra, 
K.-D.B. Zootaxa 4109, 153-172 (2016). 


WOON 


GREGORY TOBIAS/CHEMICAL HERITAGE FOUNDATION/SPL 


wot 4 


BOOKS & ARTS | COMMENT | 


In James Gillray’s 1802 cartoon, young Humphry Davy works the bellows at a Royal Institution lecture on pneumatics. 


A chemist’s contradictions 


Mark Peplow parses a book on Humphry Davy’s dazzling mix of personas. 


t is November 1799, and Humphry 

Davy is plastered. Fuelled by alcohol and 

nitrous oxide, the callow 20-year-old — 
who within a decade would be feted as the 
greatest English scientist since Isaac Newton 
— is seized by an epiphany. “Nothing exists 
but thoughts!” he cries. “The universe is 
composed of impressions, ideas, pleasures 
and pains!” 

Historian Jan Golinski recounts this 
“unprecedented binge” in The Experimental 
Self, a study that unpicks the chemist’s com- 
plexities and contradictions. It was Davy the 
experimentalist who prepared the ‘laughing 
gas and meticulously recorded its effects. 
But it was Davy the Romantic poet who gave 
vent to passionate pronouncements while in 
the grip of the gas. This amalgam of char- 
acteristics helped to establish his reputation 
as a swashbuckling scientist, a glamorous 
adventurer at the frontiers of knowledge 
and experience. 


From relatively — 
humble beginnings | 
in Cornwall, Davy 
ascended to the presi- | 
dency of the Royal | 
Society in London, 
winning a knighthood 
and a baronetcy along 
the way. He discovered 
a slew of chemical The Experimental 
elements, including Self: Humphry 


sodium and potas- Pavyand the 

: Making of a Man of 
sium, developed a cuinnca 
safetylamp for miners jay Goins 
and helped to make — University of Chicago 
chemistry the most Press: 2016. 


fashionable science of 

the age. He worked at London’s Royal Insti- 
tution (RI), which exploited his fame to raise 
money, and by 1808 he had used the fund- 
ing to build a monster 2,000-plate voltaic 
battery. It was the Large Hadron Collider of 
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its day, a device for carving matter into its 
constituent parts — and a symbol of national 
pride, beating the French in a race of voltaic 
one-upmanship. 

As a gifted scientist and charismatic 
personality, Davy has been a magnet for 
biographers for two centuries. Golinski 
takes a new approach. The Experimental 
Self does not claim to be a comprehensive 
biography. Instead, it investigates the differ- 
ent identities that Davy constructed, along 
with those ascribed to him by others: enthu- 
siast, genius, dandy, discoverer, philosopher, 
traveller. 

Golinski’s framing echoes Davy’s own 
musings. When he was about 21, Davy 
wrote a piece in his notebook entitled 
‘The History of Passion — A Philosophi- 
cal Narrative, detailing identities that a 
person might adopt throughout their life. 
He also wrote unpublished short stories 
exploring how character archetypes of > 
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> the day — such as a student or a lover 
of nature — might be changed by their life 
experiences. Golinski cites these “experi- 
ments in selfhood” as evidence that Davy 
thought carefully about his character 
to create different personas, which he 
deployed strategically to build his career. 
He deliberately shaped his speech, manners 
and deportment to fit into London’s high 
society, and used his talents as a showman 
and rhetorician to attract huge audiences 
to his RI lectures on the latest chemical 
phenomena. 

Golinski thus provides a fluid view of 
Davy, hopping back and forth between dif- 
ferent periods of his life. It’s a refreshing 
approach, although it leads to some repeti- 
tion when the same periods of Davy’s life 
(and even the same incidents) are viewed 
through different lenses of personality. 

Davy the enthusiast developed during 
the nitrous oxide experiments at the Medi- 
cal Pneumatic Institution in Bristol, where 
he drew on the Romantic aesthetic of the 
sublime to describe the mingled pleasure 
and fear that the gas triggered in himself 
and willing volunteers. At the time, ‘enthu- 
siast’ meant someone who was mentally 
unbalanced by their passions; Davy used 
the word about himself to acknowledge 
how the gas distorted his perceptions. 
That work soon stirred controversy, with 
critics ridiculing the effects as a collective 
hysteria. Public shows of intoxication were 
lampooned by satirists such as the cartoon- 
ist James Gillray. 

Once he moved to the RI in 1801, 
Davy took a more moderate tone. He still 
inhaled the gas during lectures, but did not 
offer it to others. Nevertheless, the lectures 
quickly became London’s hottest ticket. 
Unlike the learned Royal Society, the insti- 
tution had a mission to diffuse knowledge 


Humphry Davy as a young man. 


to the public and exhibit applications of 
science. Davy’s dramatic demonstrations 
delivered that in spades, offering specta- 
cle and enlightenment in equal measure. 
The RI’s tiered lecture theatre was soon 
crammed with toffs, including a remark- 
able number of women. 

Davy the genius emerged from this period 
of public exposure. By emphasizing manner- 
isms associated with intellectual greatness 
— eloquence, intensity, expressive move- 
ments — he cemented his position as a lead- 
ing thinker, as well as a chemical showman. 
Riding a wave of success, he wrote the first 
volume of a chemistry textbook in 1812. 
But Elements of Chemical Philosophy was a 
flop. By putting himself and his experiments 
front and centre, the book appeared to bea 
self-congratulatory celebration of Davy the 


discoverer, and was roundly criticized by 
scientists such as chemist Thomas Thomson 
for lacking the neutrality expected of a sober 
overview of the field. 

Increasingly, Davy’s carefully constructed 
personas caused problems. He became pres- 
ident of the Royal Society in 1820, but his 
charisma cut less ice there. His suggestion 
that women should be allowed to attend 
evening meetings at the society, as at the RI, 
was rejected, and even led some to question 
his masculinity. Meanwhile, a younger gen- 
eration of scientists resented his dependence 
on public display and aristocratic patronage. 
They wanted to be elected to the society on 
merit, funded by governments rather than 
lords. As the age of the professional scien- 
tist dawned, Davy started to look like an 
anachronism. 

Golinski ends with Davy the traveller. 
The scientist made a series of continental 
journeys, conducting research on anything 
from the chemistry of volcanoes to fresco 
pigments as he toured the cities of Europe. 
This was the full Davy roadshow, a heady 
blend of the scientific and Romantic: he 
would make poetic notes about the scenery 
and the weather, and faithfully record their 
effects on his health. Davy had a series of 
strokes during these trips, and died in 1829; 
his final book, Consolations in Travel, was 
published the next year. It features dialogues 
between fictional characters — including 
two reflecting aspects of himself — that 
explore philosophical questions such as 
the nature of the soul. “Consolations could 
be seen as his last virtuoso performance,” 
writes Golinski, “the last of his experiments 
in selfhood.” = 


Mark Peplow is a science journalist based 
in Cambridge, UK. 
e-mail: peplowscience@gmail.com 
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England’s green and well-known land 


Stuart Pimm extols Richard Fortey’s scientific and historical portrait of a beechwood. 


and off to the northwest you will 

spot a sprinkle of dark-green patches 
along the undulating fields and hedge- 
rows. The Chilterns are hills sitting on a 
chalk escarpment, and retain some wood- 
land because of their soils, topography 
and underlying geology, and the special 
management that these demand. In 2011, 
palaeontologist and natural historian 
Richard Fortey bought 2 hectares of 
beechwood and bluebells here and began 


FE ly into London’s Heathrow airport, 
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a diary — a “biography’, as he puts it. 

In Wood for the Trees, Fortey’s story 
unfolds over two interlaced time scales: 
one a calendar year, the other two mil- 
lennia of recorded history. His year starts 
in April, witness to one of those intense 
English springs that 
can follow a long, 
damp winter. Like 
Robert Browning's 
famous paean to 
the season (1845’s 


> NATURE.COM 
For more on science 
in culture see: 
nature.com/ 
booksandarts 
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‘Home-Thoughts, from Abroad’), which 
began, “Oh, to be in England/Now that 
April’s there’, Fortey’s text is stiff with the 
names of trees, flowers and birds. There’s 
a pattern here: a very British predilection 
for natural history. This means that British 
flora and fauna are exceptionally well 
documented. I took this for granted as a 
young naturalist in Derbyshire, only to get 
a rude shock when looking for field guides 
in other countries. A well-illustrated guide 
to the native trees of South Florida (where 


ROB FRANCIS 


I write this) was not published until 2014. 

It’s easier to know the names of species 
in island ecosystems, of course. They have 
fewer species than those of larger land- 
masses; many continental flora and fauna 
are excluded by what William Shakespeare 
called Britain’s “moat defensive”. There are 
(arguably) around 30 native British trees, 
for example. In any case, the passion 
for naming everything seems uniquely 
British. 

As the year unfolds, Fortey names famil- 
iar birds, trees and flowers, butterflies 
— speckled wood 
(Pararge aegeria) 
and peacock (Aglais 
io) — dormice and 
squirrels. He also 
identifies moths, land 
snails, spiders, slime 
moulds, flies, beetles, 
mosses, fungi, lichens 
— and three species 
of pseudoscorpion. 

Globally, taxonomists 

have described only small fractions of the 
species in these taxa. Fortey’s wood is home 
to more than 300 species of fungi, including 
the familiar stinkhorn fungus, graced with 
the unforgettable scientific name of Phallus 
impudicus. 

He notes his sources for identification. 
For those of us who still struggle with tools 
such as dichotomous keys, there are pic- 
ture books, which we can scan for an image 
of the unknown species. William Keble 
Martin’s 1965 The Concise British Flora 
in Colour (Ebury) is the landmark. The 
product of decades of painstaking illus- 
tration, ignored by presses worried about 
the cost of colour reproduction, it became 
an instant best-seller. In one photo, we see 
Fortey perusing illustrations to help him 
to identify a moth at a light trap. He asks 
colleagues from London’s Natural History 
Museum, where he worked as a palaeon- 
tologist, to visit. He brings in experts on 
flies and canopy insects, and others to lis- 
ten to the ultrasound of resident bats. 

The history of Fortey’s beechwood 
begins in Roman times and runs through 
the Anglo-Saxon, Viking and Norman 
invasions, providing an exceptionally 
detailed record. This is an intensively 
managed landscape for which the names 
and personal histories of centuries’ worth 
of owners are known. The wood survives 
because it was a working wood. English 
yew trees provided the bows for archers at 
the Battle of Agincourt in 1415. Ash made 
tool handles; cherry, furniture. The wood- 
workers had names, too: bodgers, turners 
and the sawpit operators’ top dogs and 
underdogs. 

A yet longer history is the geological 
record. Over the 200 kilometres from 
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The view across the Assendon Valley to Richard Fortey’s piece of woodland in England’s Chiltern Hills. 


Derbyshire to London lie 250 million 


years of rocks, from the lower Carbonifer- 


ous period, 300 million years ago, to the 


Eocene epoch, around 50 million years 
ago. They sit gently on top of each other, 
in a perfect sequence, with hills such as 
the Chilterns mark- 
ing out major transi- 
tions. Without this 
elegant simplicity and 
a deep understanding 
of the natural his- 
tory that it shapes, 


would William Smith 

(see Nature 520, 294; 

2015) have produced 

the first geological The Wood for the 
> Trees: The Long 

map? 


View of Nature 
from a Small Wood 
RICHARD FORTEY 
William Collins: 2016. 


An intermediate 
history is how cli- 
mate is disrupting 
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nature. From the Victorian era onwards, 
legions of amateur enthusiasts have done 
work on species distribution that provides 
vital benchmarks. This ability to name so 
many species has meant that Britain has 
produced the world’s largest share of stud- 
ies of how plants flower earlier now than 
in the past and how butterflies live further 
north — feeding on different plants as they 
change habitats. And that is the true value 
of all these lists. As Fortey puts it: “Think 
of it as not so much an inventory as a cata- 
logue leading to compelling and interlock- 
ing stories.” = 


Stuart Pimm is professor of conservation 
at the Nicholas School of the Environment 
at Duke University in Durham, North 
Carolina, and directs the non-profit 
organization SavingSpecies. 

e-mail: stuartpimm@me.com 
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On the heredity trail 


Matthew Cobb assesses Siddhartha Mukherjee’s history of human genetics. 


Pulitzer prize for The Emperor of All Mal- 

adies (Scribner, 2010), which intertwined 
science and his own experience as an oncolo- 
gist. In The Gene, Mukherjee uses a personal 
approach to describe our understanding of 
heredity. Despite its subtitle (An Intimate 
History’), the historical sections of The 
Gene, ranging from 1860 to the present, are 
not intended to show the convoluted route to 
current knowledge. They are primarily a tool 
for explaining the basics of medical genetics. 

As a consequence, the complexities of the 
past are ironed out. Discovery is presented 
not as a messy reality full of dead ends, but as 
a linear thread leading inexorably to today. 
Conclusions of past experiments are pre- 
sented in terms of modern understanding, 
rather than as a way to explore confused con- 
temporaneous interpretations. This is a road 
often followed by scientists and clinicians 
who write history; it irritates historians, who 
know that the past was more complicated. 

The first half of the book takes us up to the 
late 1960s and presents familiar, sometimes 
erroneous versions of past milestones. For 
example, nineteenth-century genetics pioneer 
Gregor Mendel appears as a lone genius. In 
fact, his work was part ofa long-term interest 
in heredity on the part of Cyrill Napp, abbot 
of the monastery in Brno — now in the Czech 
Republic — where Mendel was a monk. This 
interest was prompted by the desire of local 
agriculturalists (including some at the monas- 
tery) to improve their animals and plants, and 
began nearly 20 years before Mendel planted 
his peas. Sometimes, the drama is down- 
played. A brief footnote describes Vernon 
Ingram and Francis Crick’s ground-breaking 
1950s demonstration that the difference 
between normal and sickle-cell haemoglo- 
bin is caused by a single-base difference in 
the relevant gene. And one of the most excit- 
ing moments in genetics history — the 1960s 
discovery of how genes encode proteins — is 
passed over in a couple of lines. 

The writing comes alive in the book's 
second half, covering the 1970s onwards, and 
introduced by Enlightenment poet Alexander 
Pope’ line: “The proper study of mankind is 
man’. Mukherjee, as a physician, rightly takes 
that declaration as his own. Here, the book 
does become intimate. Mukherjee’s account 
of the development of biotechnology com- 
panies in the 1970s is enriched by personal 
recollections from Nobel-prizewinning 
biochemist Paul Berg, in whose laboratory 


IE 2011, Siddhartha Mukherjee won a 
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DNA analyses promise to change health care. 


Mukherjee worked in the 1990s. The passages 
that describe patients with genetic diseases are 
full of the compassion that we would all wish 
from our doctors. At other points, Mukher- 
jee brings in examples from his own family, 
in particular his uncle and cousin, who both 
had schizophrenia, to frame the narrative and 
form the starting point for his examination of 
the role of genetic factors in disease. 
Mukherjee’s account of how the genetic 
basis of Huntington's disease was discovered 
is particularly effective, covering the personal 
tragedies and the motivation of the scien- 
tists who made this breakthrough, includ- 
ing Nancy Wexler, whose mother died of the 
disease. There is an atmospheric description 
of Wexler’s fieldwork region in Venezuela, 
where almost 10% of the population has 
Huntington's disease. An endnote candidly 
admits that these passages were inspired not 
by a visit to the region 
as I had imagined, but 
by a BBC Newsnight 
report about Wexler 
that can be seen on 
YouTube. Some writ- 
ers are not so honest 


about pulling aside the | 

curtain of creation. ae 
However, one con- The Gene: An 

sequence of Mukher-  jntimate History 

jee’s medical focus is gippyaptia 

that some of the most = \yUKHERJEE 


stunning discoveries Scribner: 2016. 
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from human genomics barely get a mention. 
Homo sapiens’ interbreeding with Neander- 
thals is allotted two sentences, and our spe- 
cies’ sexual encounters with the mysterious 
Denisovans are not mentioned. The genetic 
legacy of those interactions may help to 
explain aspects of epidemiology: there seem 
to be links between Neanderthal DNA and 
a number of immunological, dermatological 
and psychiatric conditions, such as depres- 
sion and the skin lesions called actinic kera- 
tosis. Furthermore, because the book centres 
on medical genetics, anyone expecting an 
exploration of the state of genetics as a whole 
will be disappointed. Our Genes would have 
been a more appropriate title than The Gene. 

A final section examines what Berg 
described to Mukherjee as “the future of 
the future” — the amazing possibilities for 
manipulating the human genome that are 
within our grasp. Mukherjee outlines the 
rise and fall of gene therapy in the 1990s, 
always with a clinician’s compassion for the 
tragic stories behind the technology; and 
discusses the potential for gene modification 
with tools such as CRISPR-Cas9. This sec- 
tion concludes with some of what Mukher- 
jee does best, combining stories of real 
patients with the ethical dilemmas raised by 
their conditions — in this case, what would 
happen if their disorders were the subject of 
prenatal or pre-implantation testing? 

The Gene finishes with a manifesto for 
living in a post-genomic world. Mukher- 
jee provides a 13-point ‘opening salvo that 
outlines our knowledge of human genetics 
and its implications. He concludes blandly: 
“we need new biological, cultural, and social 
precepts to determine which genetic inter- 
ventions may be permitted or constrained, 
and the circumstances in which these 
interventions become safe or permissible”. 
More-consistent grappling with the ethical, 
philosophical and historical debates that 
have swirled around these issues for dec- 
ades might have led to a more detailed hint 
of what those precepts might be. This vague- 
ness is frustrating: Mukherjee’s visceral and 
thought-provoking descriptions of the hor- 
rors of early-twentieth-century US eugen- 
ics clearly show what he is capable of in this 
regard, both as a writer and asa thinker. m 


Matthew Cobb is professor of zoology at 
the University of Manchester, UK. His latest 
book is Life’s Greatest Secret. 

e-mail: cobb@manchester.ac.uk 
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Reminder to deposit 
DNA sequences 


As members of the Advisory 
Committee to the International 
Nucleotide Sequence Database 
Collaboration (INSDC), which 
includes the DNA Data Bank 

of Japan (DDBJ), European 
Nucleotide Archive (ENA) and 
GenBank databases, we wish to 
remind the research community 
of the importance of depositing 
complete DNA-sequence data in 
these databases on publication 
of their results (see also 

S.L. Salzberg et al. Science 
http://dx.doi.org/10.1126/ 
science.aaf7672; 2016). Indeed, 
most journals demand a database 
accession number as a condition 
of publication. 

Access to the INSDC’s 
databases is free and unrestricted 
(G. Cochrane et al. Nucleic 
Acids Res. 44 (D1), D48-D50; 
2016), enabling researchers 
to plan experiments and to 
analyse existing data. As original 
contributions, deposited data 
form part of the scientific record 
and are citable in the literature. 
Authors can also correct and 
update their data: these amended 
records may be removed from 
the next database release, but still 
remain permanently available by 
accession number. 

The INSDC has also created 
major new repositories for 
large data collections, notably 
the Sequence Read Archive 
at the National Center for 
Biotechnology Information 
(NCBI), the DDBJ Sequence 
Read Archive and the ENA 
at the European Molecular 
Biology Laboratory's European 
Bioinformatics Institute (EMBL- 
EBI). These repositories archive 
raw data from sequencing 
experiments, a crucial facility for 
reproducibility and reuse. 

For papers dependent on 
sequence data from human 
subjects, unrestricted data 
release may not be possible. 

In these cases, we would 
encourage journal editors to 
insist on data sharing through 
other repositories that are not 


part of the INSDC, such as the 
NCBI's Database of Genotypes 
and Phenotypes, EMBL-EBI’s 
European Genome-phenome 
Archive or DDBJ’s Japanese 
Genotype-phenotype Archive. 
Steven L. Salzberg* Johns Hopkins 
University, Baltimore, USA. 
salzberg@jhu.edu 

*On behalf of 9 correspondents (see 
go.nature.com/tvilbd for full list). 


El Nifio dons winter 
disguise as La Nifia 


Seasonal weather forecasting 
relies almost exclusively 
on El Nifio, the climate 
phenomenon associated with 
warming in the equatorial Pacific 
Ocean. When the strongest 
El Nifio on record developed 
last autumn, it offered an 
opportunity to showcase decades 
of investment and advances 
in long-range forecasting (see 
Nature 529, 267-268; 2016). 
Surprisingly, however, actual 
winter weather events were the 
opposite of those predicted. 

For example, southern 
California’s winter was 
more about heatwaves and 
wildfires than deluges; Seattle 
in Washington endured the 
wettest winter on record rather 
than a worsening drought; and 
the upper Mississippi valley 
experienced flooding to an extent 
that had previously occurred 
only in summer. 

El Nifio should strengthen 
the side of the jet stream 
that is nearer to the Equator, 
bringing wet weather to the 
southwest United States and cool 
temperatures to the southeast. 
Instead, the side nearest to the 
pole strengthened. This brought 
weather that would be more 
expected of La Nifia, the opposite 
phase of the Southern Oscillation 
that results from cooling waters. 

I suggest considering this 
inaccurate El Nifio forecast in 
the wider context of the Arctic’s 
influence. Low Arctic sea ice 
and high Eurasian snow cover 
this autumn increased a Siberian 
high-pressure system and heat 


transport towards the North 
Pole, weakening the polar vortex 
this winter (see J. Cohen et al. 
Nature Geosci. 7, 627-637; 2014). 
The atmospheric responses to 
Arctic ‘amplification were better 
predicted than were those to 

El Nifio outside the tropics (see 
www.aer.com/winter2016). 
Judah Cohen Atmospheric 

and Environmental Research, 
Lexington, Massachusetts, USA. 
jcohen@aer.com 


Reform oversight of 
Italy’s science funds 


We wish to highlight the stark 
contrast between the hardship in 
Italy’s publicly funded research 
community (see G. Parisi Nature 
530, 33; 2016) and the reportedly 
largely unused sums of money 
allocated to a single research 
institute over the past 12 years. 

After 3 years without any 
financial provision for bottom-up 
research, Italy's government 
is providing €31 million 
(US$36 million) a year for the 
next 3 years to cover research 
in the humanities as well as 
science. Of this year’s 4,431 grant 
applications for this modest sum, 
just 300-500 will be successful. 

By contrast, the government 
plans to inject €150 million a 
year for the next 10 years into 
the Human Technopole project, 
which will focus on genomics, big 
data, ageing and nutrition. The 
recipient is the Italian Institute 
of Technology in Genoa, which 
is self-governing and so not 
publicly accountable — despite 
the large sums of public money 
involved. A recent report 
indicates that half of the institute's 
funds remained unspent in 
2010-14 (see go.nature.com/ 
vouywf; in Italian). 

The government should 
establish an adequately funded 
agency that has transparent 
jurisdiction over the funding and 
execution of research. The agency 
would also monitor the progress 
of the Human Technopole and 
oversee its accountability. (See 
also go.nature.com/hgfpl}.) 


Ernesto Carafoli* Venetian 
Institute of Molecular Medicine, 
Padua, Italy. 

Cesare Montecucco* University 
of Padua, Italy. 
ernestocarafoli@gmail.com 
*Supported by 13 signatories (see 
go.nature.com/3xxovu for full list). 


Regulate devices for 
brain stimulation 


We are concerned that the rapid 
development and increased 
accessibility of non-invasive 
technologies with alleged 
brain-enhancing capabilities is 
allowing commercial interest to 
outpace regulatory mechanisms 
(see Nature 531, 283-284; 2016 
and Nature 531, S6—S8; 2016). 
Only limited technical 
ability is required to build a 
brain-stimulation device at 
home — or to dress it up and 
market it commercially. Even 
though electrical current can 
endanger cardiovascular and 
neural function (see go.nature. 
com/ej3kgx), there are currently 
no requirements for safety or 
efficacy testing of home-use 
devices through clinical-style 
trials. Beyond the safety of the 
devices themselves, the impact of 
regular or sustained personal use 
of brain stimulators is unknown. 
The public may not appreciate 
that companies are subject toa 
strict regulatory framework if 
their product claims to help an 
individual to achieve normal 
function (that is, a treatment), 
but not if it is sold to enhance 
function. We urge governments 
to align their regulatory 
standards for both applications. 
Olivia Carter, Jason Forte 
University of Melbourne, 
Australia. 
ocarter@unimelb.edu.au 
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For News & Views online, go to 
nature.com/newsandviews 


HUMAN EMBRYOLOGY 


Implantation barrier overcome 


The early stages of human development are normally hidden within the womb, but improved techniques for culturing 
embryos from the blastocyst stage promise to make these steps easier to investigate. SEE LETTER P.251 


JANET ROSSANT 


tudying early development in human 
embryos is a challenge. Few embryos 
are available, and research is subject to 
considerable ethical and legal constraints. But 
understanding early-stage development is 
vital for improving reproductive technologies, 
enhancing stem-cell cultures for regenerative 
medicine and examining early pregnancy 
losses. In two papers, Deglincerti et al.’ 
(page 251 of this issue) and Shahbazi et al.’ 
(in Nature Cell Biology) report that human 
embryos derived from in vitro fertilization 
(IVF) can self-organize in a Petri dish, forming 
the founding cell lineages of the fetus and its 
supporting tissues. This is a first step towards 
a clearer view of the beginnings of human life. 
In mammals, including humans, a fertilized 
egg undergoesa series of cell divisions over the 
first days of development, leading to the forma- 
tion ofa structure called the blastocyst (Fig. 1). 
The first cell-lineage decisions are made at this 
stage, with a lineage called the epiblast, which 
goes on to form the entire fetus, becoming 
separated from two lineages that will produce 
non-embryonic tissues — first the trophecto- 
derm and then the primitive endoderm. The 
trophectoderm gives rise to cells that form 
most of the placenta, whereas the primitive 
endoderm forms some layers of the yolk sac, 
which is required for early fetal blood supply. 
The mechanisms that underlie blastocyst- 
stage lineage specification are well understood 
in mice, and it had been assumed that these 
pathways are evolutionarily conserved. How- 
ever, that assumption has been challenged’. 
Although many of the genes that direct line- 
age decisions in mouse embryos are expressed 
in the same lineages in humans, the timing of 
onset and the upstream pathways that regulate 
their expression differ between the species”. 
The blastocyst becomes implanted in the 
lining of the uterus just five days after fertili- 
zation in mice and seven days after in humans. 
This is a vital period, in which trophecto- 
derm-derived cells begin to interact with the 
uterus, and the embryo progresses towards 
perhaps the most crucial step in develop- 
ment — gastrulation, in which an epiblast- 
derived cell mass called the primitive streak 
gives rise to the three basic cell layers from 
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Figure 1 | Human embryo growth in vivo and in vitro. During early human embryonic development, 
cells form a structure called the blastocyst, which is comprised of three lineages — the epiblast, which 
will form the fetus, and the trophectoderm and primitive endoderm, which support embryonic growth. 
In vivo, by around 12 days post-fertilization, the blastocyst has implanted in the uterus and undergone 
the first cell-lineage decisions. The epiblast forms an amniotic cavity and a cell mass called the primitive 
streak, which will produce the body’s three major tissue layers. Cells derived from the primitive 
endoderm form a yolk sac, which is involved in early blood supply. Trophectoderm-derived cells form 
external structures. Deglincerti et al.'and Shahbazi et al.’ cultured human blastocysts in vitro. Similar 
structures and cavities form, although their spatial relationships differ from those in vivo. In addition, 
Deglincerti and colleagues observed a previously unidentified cell type, which they dub yolk-sac 
trophectoderm, although its origin is unclear. (Cultured embryo adapted from ref. 1.) 


which every bodily structure is derived. 

In mice, signals from the primitive endo- 
derm and trophectoderm initiate formation 
of the primitive streak®. But in humans, this 
period of development has been a complete 
black box. The only available information has 
come from rare cross-sections cut through 
human embryos and from non-human pri- 
mates, such as rhesus monkeys. Those stud- 
ies’”* show that there are major differences 
between primate and mouse development 
as the embryo implants in the uterus. Most 
notably, the mouse epiblast forms a cup-like 
structure, on one side of which form the primi- 
tive streak and amniotic folds (which will later 
form the fluid-filled amniotic membranes). 
By contrast, the primate epiblast first forms 
a central amniotic cavity and then flattens 
out to form a disc, from which the primitive 
streak arises at one end (Fig. 1). The spatial 
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relationships between lineages therefore 
differ between species. 

The development ofa strategy for culturing 
human embryos in vitro over the early post- 
implantation period could improve our 
understanding of the significance of these 
differences. Using a system developed for cul- 
turing mouse embryos’, Deglincerti et al. and 
Shahbazi et al. did just that, culturing human 
embryos derived from IVF up to a stage equiv- 
alent to 13 days post-fertilization in vivo. An 
improved culture medium and a better sub- 
strate for embryo attachment seem to have 
been the key to these advances. 

The groups report that blastocysts attach to 
the dish, the trophectoderm spreads out and 
shows signs of differentiation into specialized 
placental cell types, and the primitive endo- 
derm segregates from the epiblast. Shahbazi 
and colleagues found that a small central 


cavity develops within the epiblast, apparently 
because the lineage reorganizes into a radially 
polarized structure. This is reminiscent of 
how the amniotic cavity is thought to form in 
both human and rhesus-monkey embryos”*, 
although the absence of good molecular mark- 
ers of amniotic tissue precludes a conclusive 
identification of this structure. 

Both groups also observed a second cavity in 
the spreading primitive endoderm, which they 
equate to the yolk-sac cavity. Deglincerti et al. 
report that the cells lining this cavity express 
genes characteristic of trophectoderm-derived 
cells. The authors suggest that this is a previ- 
ously unidentified cell type, which they name 
yolk-sac trophectoderm. However, compari- 
son with anatomical descriptions’ suggests 
that these cells are more likely to be derived 
from the primitive endoderm — perhaps with 
a gene-expression profile that differs from 
that in mice. Extension of the cultures beyond 
12 days led to cavity collapse and disorganized 
development, although trophectoderm differ- 
entiation continued. 

Although these studies represent steps 
towards a firmer understanding of human 
development over the implantation period, 
there are many limitations still to overcome. 
The cultured embryos are largely flattened and 
two-dimensional, and so are clearly imperfect 
models of normal 3D embryonic develop- 
ment. In addition, unequivocally identifying 
cell types, cavities and structures in the cul- 
tures is challenging. Genome-wide expression 
analysis of these features might help to refine 
the system. 

This culture method could enable research- 
ers to probe the role of signalling molecules 
from the extraembryonic tissues in patterning 
the epiblast. By comparing these results with 
the signalling molecules detected in embry- 
onic-stem-cell cultures that mimic events of 
gastrulation’, we might better understand 
how to induce human stem cells to differ- 
entiate into cell types that have therapeutic 
potential. The development of a 3D blastocyst 
culture system, akin to the ‘organoid’ systems 
used to model more-mature tissues, could also 
be informative. If the topological relationships 
between the different cell types were more 
normal in such 3D cultures, this might enable 
gastrulation to occur in vitro. 

Currently, human embryo cultures are 
restricted, by international agreement, to 
14 days of growth or the beginning of primi- 
tive-streak formation, whichever comes first. 
If gastrulation were achievable in vitro, what 
would be the impact on this 14-day rule? 
Improved and longer cultures could provide 
important information for basic human biol- 
ogy, improving IVF success rates and the 
understanding of stem-cell differentiation. 
However, the development of such culture sys- 
tems would again raise the question of where 
to place the ethical limits on human embryo 
development in vitro. mSEEALSO COMMENT P.1G9 


Janet Rossant is in the Peter Gilgan Centre 
for Research and Learning, The Hospital for 
Sick Children, Toronto, Ontario M5G 0A4, 
Canada, and in the Department of Molecular 
Genetics, University of Toronto. 
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Precision pruning 
of molecules 


If organic molecules were trees, then the numerous carbon- hydrogen bonds 
within them would be leaves. A catalyst that targets one ‘leaf’ out of many similar 
other ones looks set to be a huge leap for synthetic chemistry. SEE LETTER P.230 


KIN S. YANG & KEARY M. ENGLE 


chemists have focused on the reactions 

of functional groups: arrangements 
of atoms that have a characteristic, predict- 
able reactivity profile. Such reactions have 
proved to be powerful tools for assembling 
structurally complex molecules for use in 
applications ranging from human medicines 
to pesticides, but they sometimes introduce 
inefficiencies that increase waste and costs. To 
expedite synthetic endeavours, attention has 
thus turned to a new generation of reactions 
— known as C-H functionalization reactions 


much of the past century, organic 


— that convert previously neglected carbon- 
hydrogen (C-H) bonds into useful reactive 
functional groups. A key question is how to 
precisely target a single C-H bond for reac- 
tion among so many others. On page 230 of 
this issue, Liao et al.! report an outstanding 
advance that addresses this problem. 
Carbon-carbon (C-C) and C-H bonds 
are the most common connecting units in 
organic molecules; if a molecule is viewed as 
a tree, then C-C bonds link together to make 
up the trunk and branches, which C-H bonds 
adorn as leaves. C-H bonds are so widespread. 
that organic chemists almost always omit them 
when drawing chemical structures, instead 
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Figure 1 | A selective C-H functionalization reaction. Liao et al.' report a catalyst that can differentiate 
between similar carbon-hydrogen (C-H) bonds in hydrocarbons, such as pentane, so that the C-H bond 
at the second carbon atom reacts selectively with a diazo compound (blue) — a C2 functionalization 
reaction. This occurs with more than 95% selectivity, even though many other similarly reactive 

C-H bonds (not shown) are present in the molecule. Small amounts of products from C1 and C3 
functionalization are also formed. Ar is a 4-substituted phenyl group or a 6-chloropyridin-3-yl group 

(see Fig. 4 of the paper’ for molecular structures); R is CH, or CX;CH,, where X is fluorine, bromine or 
chlorine; Ar’ is 3,5-di(4-t-butylphenyl)pheny]; Rh, rhodium; Ph, phenyl. 
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showing only the skeleton of carbon atoms and 
heteroatoms (those that are neither carbon nor 
hydrogen). The fact that C-H bonds are not 
typically drawn also speaks to another, more 
subtle truth: they do not usually react in classic 
organic transformations. 

This lack of reactivity is because the bonds 
are strong, are non-acidic and do not have 
inherent affinities for other chemical entities”. 
Collectively, these issues pose a difficulty: C-H 
bonds will seemingly be cleaved only by ener- 
getic reagents or under ‘harsh’ conditions (such 
as high temperatures), but such reactions are 
unlikely to be selective for a single C-H bond. 
For example, the combustion of hydrocarbons, 
a powerful process used to produce heat since 
the rise of human civilization, non-selectively 
cleaves all C-H and C-C bonds in the starting 
material to produce carbon dioxide and water. 
Reactions that achieve controlled and selective 
functionalization of C-H bonds are akin to 
carefully pruning a single leaf from a tree and 
grafting on a new branch at that position. Such 
transformations are underdeveloped, yet hold 
the promise of permanently changing how 
chemists design and synthesize molecules. 

More than 50 years ago, inorganic and 
organometallic chemists discovered that transi- 
tion metals interact with C-H bonds in unique 
ways, enabling ready C-H functionalization””. 
More recently, organic chemists have become 
interested in this reactivity with an eye to 
developing selective reactions of C-H bonds”. 
Existing strategies for obtaining such selective 
reactions generally involve an approach called 
substrate control, which depends on the intrin- 
sic structural features and reactivity patterns of 
molecules. It remains a tremendous challenge 
to develop catalyst-controlled reactions that 
distinguish the subtle steric and electronic dif- 
ferences between C-H bonds in molecules that 
lack any functional groups (steric effects are 
those associated with the spatial crowding of 
chemical groups and atoms). Liao et al. tackled 
this issue directly by investigating the selective 
functionalization of pentane, a simple molecule 
that contains only C-C and C-H bonds. 

Pentane contains a chain of five carbon 
atoms and, because of molecular symmetry, it 
has three distinct positions at which reactions 
can occur: the chain ends; the carbon atoms 
next to the ends (also known as the C2 posi- 
tions); and the middle carbon atom. The 
authors sought to design a catalyst that would 
promote a highly selective reaction at a C-H 
bond at C2, a formidable undertaking (Fig. 1). 

The same research group had previously 
pioneered a strategy for taming the reactivity 
of dirhodium carbenoids, a class of organo- 
metallic complex, by attaching electron-donor 
and electron-acceptor substituents to them®”. 
The resulting species are still highly reactive, 
but are long-lived enough to participate in 
reactions with other molecules and to selec- 
tively target activated C-H bonds (such as 
those next to a C-C double bond, a benzene 


184 | NATURE | VOL 533 | 12 MAY 2016 


ring or an oxygen atom). Because pentane con- 
tains only unactivated C-H bonds, a strategy is 
needed to enable these catalysts to distinguish 
between the slight differences in steric and 
electronic properties associated with the pos- 
sible reaction sites. This means that the size, 
shape and electronic environment around the 
reactive metal centres in the complexes need 
to be finely tuned. 

The authors therefore tested a number of 
catalysts for reactivity and selectivity in the 
C-H activation of pentane. Synthesizing a 
library of new catalysts for screening is often a 
bottleneck in reaction optimization. Catalysts 
tend to consist of organic ligand molecules 
bound to metals, and so, for each catalyst, 
chemists generally synthesize the ligand first 
and add the metal in a subsequent step. 

To expedite this typically tedious process, 
Liao et al. used an ingenious approach. They 
first synthesized a versatile dirhodium precur- 
sor complex, from which new catalysts could 
be prepared at a single stroke — streamlining 
catalyst discovery in a process that parallels 
methods used for discovering small-molecule 
drugs. In this way, the authors discovered a 
catalyst that preferentially functionalizes a 
C-H bond at the second carbon of pentane 
rather than C-H bonds at the first or third 
carbons, forming a new C-C bond at that car- 
bon with more than 95% selectivity. This is a 
truly remarkable accomplishment, given the 
similarity between the three C-H bonds. 

The authors went on to show that their 
reaction is similarly effective with other satu- 
rated hydrocarbons, and with some simple 
compounds that contain potentially reactive 
functional groups such as halides, silanes and 
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esters. Moreover, the transformation is highly 
enantioselective (it yields only one of the two 
possible mirror-image isomers of the product). 
Enantioselective reactions are crucial for the 
synthesis of many biologically active com- 
pounds, including pharmaceuticals. 

Although it remains to be seen how 
generally useful the optimal catalyst for C2 
functionalization in pentane will be as a tool 
for synthesis, the authors’ platform for cata- 
lyst screening and optimization will potentially 
allow catalyst structures for any given substrate 
to be tailored as needed. More broadly, Liao 
and colleagues’ research represents an impor- 
tant step towards achieving high selectivity 
in catalytic C-H functionalization, even in 
the most challenging contexts. Reactions of 
this type would allow organic molecules to 
be groomed, pruned and shaped with bonsai- 
like precision, readying them for an array of 
potential applications. = 
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Ancient air caught 
by shooting stars 


Ashes of ancient meteors recovered from a 2.7-billion-year-old lake bed imply 
that the upper atmosphere was rich in oxygen at a time when all other evidence 
implies that the atmosphere was oxygen-free. SEE LETTER P.235 


KEVIN ZAHNLE & ROGER BUICK 


edged that Earth’s atmosphere before 

about 2.5 billion years ago had little or no 
free oxygen. The classic argument for anoxia on 
ancient Earth is that a distinct change occurred 
in the oxidation state of many surface rocks and 
minerals around the end of the Archaean eon” 
(which lasted from 4 billion to 2.5 billion years 
ago). A more recent argument is that a sudden, 
permanent change in the relative abundances 


E is a truth almost! universally acknowl- 
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of rare sulfur isotopes preserved in sediments 
also occurred at that time — a change that can 
be linked to differences in sulfur’s atmospheric 
chemistry in the presence or absence of oxy- 
gen . These arguments are strong. It therefore 
comes as a surprise that melted meteor frag- 
ments recovered from Archaean limestone 
indicate that the contemporaneous atmosphere 
above 75 kilometres was highly oxidized, as 
reported by Tomkins et al.” on page 235. 

The authors recovered 60 micrometeor- 
ites from 2.7-billion-year-old limestone 
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Figure 1 | Possible mechanisms of oxygen enrichment in the Archaean atmosphere. Tomkins et al.’ 
conclude that micrometeorites found in sedimentary limestone deposited 2.7 billion years ago were 
oxidized by their passage through an oxygen-rich upper atmosphere at a time when the lower atmosphere 
was anoxic. a, Oxygen could have formed in the upper atmosphere if carbon dioxide was split by solar 
ultraviolet radiation to form oxygen and carbon monoxide. However, the CO would have to have been 
removed to enrich the upper atmosphere in oxygen. b, Water vapour (most of which is likely to have 
condensed in the lower atmosphere) could have split to form OH radicals and hydrogen atoms, so that the 
OH could react with CO to regenerate CO,. The hydrogen would have escaped to space, leaving oxygen 
behind. c, Alternatively, if sulfur dioxide from volcanoes was split by solar UV to form oxygen and sulfur 
particles, then the particles could have fallen to Earth’s surface, leaving oxygen behind. 


in the Pilbara region of Western Australia. 
Micrometeorites are the surviving bits of 
meteors that were too small to burn up as 
shooting stars in the atmosphere, and are typi- 
cally tens of micrometres in diameter. All but 
one of the Pilbara micrometeorites were origi- 
nally sand-sized grains of iron and nickel alloy. 
Of the 11 studied in detail, 9 are composed of an 
oxidized mineral called magnetite (Fe,O,) and 
retain a distinctive morphology that indicates 
fast cooling. The other two retain some of the 
original metal and wiistite (FeO), an iron oxide 
that occurs in meteorite fusion crusts. Nearly 
as remarkable as their existence is that there is 
nothing otherwise remarkable about them — 
they look like, and are as oxidized as, the iron 
micrometeorites that fall to Earth today’. 
Tomkins et al. argue that the micrometeorites 
look modern because the air above 75 km was 
roughly as oxidized during the Archaean as it is 
today, and back this up using a model of meteor 
physics and chemistry tuned for modern Earth. 
The model shows that the air in that region 
needed to be oxygen-rich for it to oxidize all 
the iron to magnetite as it slowed the meteor’s 
flight. If, however, the chemical reaction con- 
tinued for a little longer than was modelled, it 
might have sufficed for the air to be less oxic. 
Carbon dioxide could have acted as an alterna- 
tive oxidant (see Extended Data Fig. 5 of the 
paper’), although the kinetics for oxidation 
with CO, are less favourable than with oxygen. 
The idea that an oxygen-rich upper 
atmosphere sat on top of an anoxic lower atmos- 
phere poses a serious challenge to atmospheric 
modellers. Models predict® that oxygen can be 
abundant at extremely high altitudes in other- 
wise anoxic atmospheres, but because the oxy- 
gen comes from CO, that was split by sunlight 
(photolysis), it is balanced by a stoichiometric 


complement of carbon monoxide (Fig. 1). 
Overall, the resulting gas mixture would be no 
more oxidizing than CO, itself. 

To create a local superabundance of 
oxygen from CO,, the CO must be preferen- 
tially removed. There is no obvious way to do 
this. Some other molecule is therefore required 
that can be split into oxygen and a chemically 
reduced species that can be easily removed. 
One possible candidate is sulfur dioxide (SO,) 
from volcanoes. This gas can be split by sun- 
light into oxygen and elemental sulfur, which 
can condense to form particles that fall to 
Earth, leaving oxygen behind. Isotopes in sedi- 
mentary rock indicate that elemental sulfur did 
fall from the skies during the Archaean’, which 
makes SO, an attractive candidate. 

The other obvious candidate is water 
vapour, which can be split by sunlight to free 
hydrogen atoms that escape to space, leaving 
oxygen behind. Water can do double duty 
here: the hydroxyl (OH) radicals generated 
by water photolysis react with CO to put CO, 
back together again, freeing more hydrogen 
atoms. If the hydrogen atoms escape to space 
quickly enough, then the top of the atmosphere 
can become rich in oxygen. This is why Mars 
has more oxygen than CO in its atmosphere. 

But today, water is cold-trapped in Earth's 
lower atmosphere and the upper atmosphere is 
very dry. For water vapour to have reached the 
upper atmosphere during the Archaean, either 
the cold trap must have been warmer than it is 
today, or the atmosphere must have been thin- 
ner, which would have made the cold trap less 
effective’. Evidence’ that atmospheric pressure 
was less than half of what it is today — and 
perhaps much less — has been found in rocks 
of almost identical age to those hosting the oxi- 
dized micrometeorites. The micrometeorites 
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might, therefore, be evidence of a thinner 
atmosphere. 

The Pilbara micrometeorites are not the 
only preserved probes of the Archaean atmos- 
phere. Several thick beds of impact-generated 
spherules — rounded bodies formed from the 
molten ejecta of meteorite impacts — provide 
hints about the atmosphere they fell through”. 
The Archaean spherule beds are analogous to 
the thinner spherule layers broadcast world- 
wide by the Chicxulub impact that killed 
the dinosaurs. One distinction between the 
Chicxulub and the Archaean spherules is that 
the latter were formed under markedly more 
reducing conditions, which has been inter- 
preted to mean that the Archaean atmosphere 
contained no more than 0.01% oxygen". The 
spherules probably last reacted with air when 
they re-entered the atmosphere as meteors — 
typically reaching a height of between 30 and 
50 km, judging from the spherules’ size (about 
a millimetre in diameter). In other words, the 
air last sampled by the spherules would have 
been between the air at Earth’s surface and that 
sampled by the micrometeorites. 

It is remarkable that objects as small as 
the micrometeorites survived intact 
for 2.7 billion years. The survival of wiistite 
is particularly unusual — this mineral is not 
normally seen near Earth’s surface — and is 
crucial to the authors’ interpretation of these 
minuscule objects as being extraterrestrial. 
However, the micrometeorites were deposited 
ina highly unusual environment: the Tumbiana 
Formation. 

This rock formation was once a system of 
lakes, and the lake in which the micrometeor- 
ites were found was highly alkaline", as indi- 
cated by its extremely high abundance of heavy 
nitrogen isotopes’. Wiistite has low solubility 
under such pH conditions”’, and would have 
been especially insoluble if the bottom waters of 
the lake and the pore waters in the buried sedi- 
ments were anoxic. Such conditions are rarely 
encountered in the geological record, which 
means that the Pilbara micrometeorites might 
bea one-off discovery. This would be unfor- 
tunate, because the structure, pressure and 
vertical composition of the ancient atmosphere 
are fiendishly difficult things to determine. But 
one can wish upon a shooting star. m 
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Quasiparticles on a 
collision course 


Emergent quanta of momentum and charge, called quasiparticles, govern many 
of the properties of materials. The development of a quasiparticle collider promises 
to reveal fundamental insights into these peculiar entities. SEE LETTER P.225 


DIRK VAN DER MAREL 


lectrons and atomic nuclei in solids are 
Hee together by their electric charges. 

If an electron is moved to a new posi- 
tion inside a metal, then the other electrons 
and nuclei respond by shifting their own posi- 
tions. An electron that is accompanied by this 
response of the surrounding electrons and 
nuclei is an example ofa quasiparticle (Fig. 1). 
It would be fascinating to prepare and manipu- 
late the trajectories of quasiparticles to make 
them collide and then to study the effect of 
the collision, similar to experiments in a par- 
ticle accelerator. On page 225 of this issue, 
Langer et al.' report the realization of such an 
experiment. 

The authors studied an electrical insulator, 
tungsten diselenide (WSe,). They gener- 
ated pairs of quasiparticles in the material, 
one negatively charged and the other posi- 
tively charged, using an ultrashort light pulse 
(10-100 femtoseconds in duration; 1 fs is 
10° seconds). The light pulse’s energy, inten- 
sity and duration were precisely adjusted so 
that the initial distance of the quasiparticles 
from each other, and their relative speeds, were 
well defined. 

Langer and colleagues then launched the 
quasiparticles along a linear track. This track 
was created with the help of the electric field 
from a second light pulse; the field strength, 
duration and oscillation period of the light 
pulse were adjusted to direct the quasiparticles 
into a head-on collision. The collision caused 
mutual annihilation of the quasiparticles and 
the emission of a photon, which the authors 
detected. The experiment is therefore similar 
to studies of electron—positron annihilation in 
high-energy particle accelerators (positrons 
are the antiparticles of electrons, which means 
that they have opposite charge and equal mass 
to an electron). 

The researchers can tune the conditions of 
their system in many ways by adjusting the 
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aforementioned experimental parameters 
and the time interval between the generation 
of the pulses and their detection. They particu- 
larly examined the effect of the electrical 
(Coulomb) interaction between the two oppo- 
sitely charged quasiparticles. Under stable 
conditions, this interaction would bind the 
quasiparticles into a neutral composite particle 
called an exciton. Excitons are another exam- 
ple of a collective state that can exist in solids, 
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Figure 1 | An emergent quasiparticle. When an 
electron moves to a new position inside a metal, 
the other electrons and atomic nuclei shift their 
own positions in response (empty circles show 
original positions of the surrounding electrons 
and nuclei; electrons are negatively charged, nuclei 
are positively charged). The combination of the 
electron and the movement of the surrounding 
electrons and nuclei is an example of a quasiparticle 
—acollective phenomenon that behaves like a 
single particle. Langer et al.' describe a system that 
allows quasiparticle collisions to be studied. 
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somewhat like positronium atoms (which 
form from one electron and one positron). 
The authors obtained material-specific infor- 
mation such as the exciton binding energy, and 
observed an enhancement of the collisional 
cross-section (a quantity that governs the rate 
at which the quasiparticles collide) as a result 
of the Coulomb force between the oppositely 
charged quasiparticles. 

The beauty of Langer and colleagues’ experi- 
mental toolkit is that it might finally allow 
quasiparticles and their mutual interactions 
to be studied in the materials in which they 
arise. The negative and positive quasiparti- 
cles in the authors’ experiment are similar to 
electrons and positrons in a vacuum, but a 
rich variety of unconventional quasiparticles 
could also be studied, for which no equivalent 
elementary particles are known. For example, 
when an electron is introduced into an insu- 
lating transition-metal oxide such as stron- 
tium titanate (SrTiO;), the electron slightly 
attracts the positive ions in the material (Ti* 
and Sr”), but slightly repels the negative oxide 
ions (O” ). When the electron moves around 
the compound's lattice, the ionic displacements 
move with the electron. The resulting object 
— the electron plus the co-moving lattice dis- 
tortion — is called a polaron””. Its properties 
and behaviour are different from those of an 
electron; for example, its mass is typically two 
or three times higher. 

Quasiparticles that are even more bizarre 
emerge in two-dimensional gases of interact- 
ing electrons in a strong magnetic field. The 
charge on these quasiparticles is a fraction of 
that for an electron: it can be one-third (the 
same as for elementary particles called quarks), 
one-fifth, one-seventh, or smaller*”. 

When a magnetic field is applied to certain 
superconductors, peculiar topological states 
known as vortices appear, equivalent to tubes 
of magnetic flux. Vortices and antivortices 
form spontaneously” in 2D superconductors, 
but it might also be possible to generate them 
using light pulses. This would open the way to 
studies of their interactions using Langer and 
colleagues’ approach, including the annihila- 
tion of vortex—antivortex pairs. 

Quasiparticles are not only of academic 
interest — they also determine many of the 
properties and functionalities of materials, 
such as electrical resistivity, heat capacity and 
magnetism. There are thus many reasons to 
study quasiparticles in the materials in which 
they are manifested. Langer and co-workers 
have provided a fresh strategy with which 
condensed-matter physicists can tackle such 
studies. This promises fundamental insights, 


but also offers ways to control and handle the 
quasiparticles characteristic of the various 
states of matter that can be realized in solids. 
That said, only a few experimental facili- 
ties will have the combination of technolo- 
gies required to study quasiparticles that have 
fractional charges, or vortex—antivortex anni- 
hilation in two dimensions. But for those that 
do, Langer and colleagues’ approach can be 
readily applied to investigate the properties of 
polarons in strontium titanate or other tran- 
sition-metal oxides, or the ‘heavy electrons’ 
that occur in several materials owing to the 
coupling of mobile electrons to fluctuations of 


CELESTIAL MECHANICS 


magnetic polarization* '°. According to some 
schools of thought, the quasiparticle concept 
does not apply in certain materials or under 
special conditions’’. Collision experiments 
might therefore help to identify the boundaries 
of the quasiparticle concept. m 
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Fresh solutions to the 
four-body problem 


Describing the motion of three or more bodies under the influence of gravity 
is one of the toughest problems in astronomy. The report of solutions to a large 
subclass of the four-body problem is truly remarkable. 


DOUGLAS P. HAMILTON 


he study of the orbital motions of bodies 

that are subject to their mutual gravita- 

tional attractions is crucial for under- 
standing the movements of moons, planets 
and stars, and for navigating spacecraft to 
distant planets. The central problem is to 
determine the motions of n point masses 
interacting through gravitational forces 
that vary with the inverse square of their 
separation distances. This n-body prob- 
lem is famous among astronomers and 
mathematicians, and is known to have 
no general analytical solution (that is, 
no solution that can be written down in 
terms of simple mathematical functions). 
Nevertheless, specific solutions have been 
eagerly sought and occasionally discov- 
ered. Writing in Celestial Mechanics and 
Dynamical Astronomy, Erdi and Czirjak' 
report analytical solutions for a broad class 
of four-body configurations. 

Isaac Newton solved the two-body 
problem in his 1687 masterwork, the 
Principia, but the three-body problem 
proved surprisingly complex and occupied 
many distinguished mathematicians over 
the next two centuries. Leonhard Euler and 
Joseph-Louis Lagrange found all analytical 
solutions to an important subclass of the 
three-body problem known as central con- 
figurations, but work by Heinrich Bruns 
and by Henri Poincaré in the late 1880s 
showed that a general arrangement of three 


or more bodies admits no analytical solution. 
Although the set of all possible central con- 
figurations of four bodies remains unknown, 
Erdi and Czirjak have taken a large stride for- 
ward by solving all of those in which two of the 


(x,,0) 12 


(,,0) 


Figure 1 | A subclass of the three-body problem. The 
motions of three bodies with masses m,, m, and m, under the 
influence of gravitational forces can be described analytically 
— that is, in terms of simple mathematical functions — for 
the special case in which the bodies are placed at the vertices 
of an equilateral triangle. Proof of this involves considering 
the accelerations in the y direction of two masses placed on 
the x axis. It emerges that the bodies orbit in such a way that 
the triangle rotates, expands or shrinks, and always remains 
in the xy plane. The yellow star represents the centre of mass 
of the three-body system. Distances between the masses are 
represented by the symbol r, with subscripts representing the 
masses; coordinates for masses and for the centres of mass 
are given as (x,y) pairs. 
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bodies lie along an axis of symmetry. 

In central configurations, each body must 
be subject to an acceleration directed towards 
the centre of mass of the system with a magni- 
tude that is proportional to its distance from 
the centre of mass. All orbits of two bodies are 
central configurations, in which the objects 
each orbit their common centre of mass 
along ellipses that have identical shapes and 
orbital periods. Euler’s solutions for linear 
arrangements of three bodies are also central 
configurations, as are Lagrange’s solutions 
in which the three masses are placed at the 
vertices of an equilateral triangle. In the lat- 
ter system, Lagrange showed that the vertices 
of the triangle can move in such a way as to 
preserve relative distances between the masses; 
the triangle can rotate around the centre of 
mass, expand or shrink, but must remain in 
its initial plane. 

It is easy to show that equilateral trian- 
gles are the only possible planar central 
configuration of a three-body system by 
placing two of the masses (m, and m,) 
on an x axis, and considering their accel- 
erations in the perpendicular y direction 
(Fig. 1). The y acceleration on m, is due 
solely to the gravity of the third mass (m;) 
and equals Gmy,/r,;, — where G is the 
gravitational constant, y; is the coordinate 
of m, along the y axis and r,, is the distance 
between m, and m. The corresponding y 
acceleration on m, is Gmsy,/r2; . According 
to the definition of central configurations, 
these accelerations must separately equal 
Vm (where y,,, is the y coordinate of the 
centre of mass and J is the common pro- 
portionality constant). By cancelling like 
terms in the two y accelerations, it imme- 
diately becomes apparent that r,, must be 
the same as r,;. 

Ifthe symmetry of the equilateral-trian- 
gle system is then exploited by choosing a 
new x axis to run along the line connecting 
m, and m,, repeating the above argument 
shows that r,, must also be the same as r,,, 
and thus all three sides of the triangle must 
be equal in length. This proof extrapolates 
directly to four bodies: the only fully three- 
dimensional central configurations for 
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Figure 2 | A subclass of the four-body problem. Erdi and Czirjak' solved 
a subclass of the four-body problem derived from systems of three masses 
initially located on the x axis. a—c, In each case, two masses (m, and m,) are 
left on the x axis; the third mass (original position shown with open circle) is 
split into two equal halves, each of mass m, which are moved symmetrically 
in the y direction (dashed arrows). The centre of mass of each system is at 


four bodies are those in which the masses are 
placed on the vertices of a tetrahedron such 
that all distances between the masses are equal. 

But two-dimensional central solutions of 
four bodies are much more difficult to iden- 
tify. In their seminal work, Erdi and Czirjak 
find three examples of such solutions, which 
can be visualized by considering a system of 
three masses distributed on a line. Each solu- 
tion is found by splitting one of the masses into 
equal halves and moving the fragments up and 
down so that the resulting distribution of four 
masses is symmetric about the x axis (Fig. 2). 
The four-sided polygon formed by connect- 
ing the on-axis masses to the off-axis masses is 
convex when the central mass is split (Fig. 2a), 
and concave when one of the other masses is 
split (Fig. 2b,c). Erdi and Czirjak’s two concave 
cases differ according to whether the centre of 
mass of the system excluding m, is enclosed by 
the polygon or not. 

The next step would normally be to specify 
the masses and then to seek all arrangements 
of those masses that satisfy the conditions for a 
central configuration. Erdi and Czirjak, how- 
ever, chose to tackle the inverse problem: given 
the positions of the bodies, they computed the 
masses that make the configuration central. 
And, rather than working with rectangular 
coordinates for the two off-axis masses, the 
authors chose to recast the problem in terms 
of a pair of angles that fix the position of those 
masses relative to the ones on the x axis (Fig. 2). 
These are both inspired choices that make the 
problem analytically tractable. If one or more of 
the four masses is set to zero, the angles take on 
values that are consistent with straight lines and 
equilateral triangles; in this way the four-body 
edifice of Erdi and Czirjak’s work is rooted in 
the three-body bedrock of Euler and Lagrange. 

Central configurations are dynamic 
equilibria that can be stable (such as a ball at 
the bottom of a smooth bowl) or unstable (as 
for a ball perched atop a round hill). Euler’s 
straight-line configurations are all unstable 
so that, like the ball on the hill, the configura- 
tion cannot persist when tweaked. Lagrange’s 
equilateral-triangle solutions are stable if 
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one of the three masses contains more than 
about 96% of the total mass of the system, but 
unstable if the mass is more evenly distrib- 
uted. Thus, Lagrangian configurations for a 
system that incorporates the Sun, Jupiter and 
a suitably placed asteroid are stable, as would 
be those for Earth, the Moon and a modestly 
sized future space station. By contrast, Pluto's 
massive moon Charon prevents any central 
configurations involving these bodies and 
a smaller moon from being stable. Whether 
the new four-body central configurations are 
stable is an interesting, unexplored question 
and is an inviting direction for future research. 

Erdi and Czirjak’s solution to a large 
subclass of the central four-body problem is a 
major advance that encompasses and greatly 
extends many previous four-body results, 
including: arrangements of four’ and three” 
identical masses; kite-shaped configurations 
of diagonally opposite pairs of equal masses’; 
and the limiting case of three bodies plus 
a massless test particle’. Just as three-body 
configurations serve as limiting cases for Erdi 


NEUROBIOLOGY 


aC Kee 


(0,0) and the masses form convex (a) or concave (b and c) polygons when 
connected. The centre of mass excluding m, is shown with a blue star in 
band ¢; its location inside or outside the polygon distinguishes the 

two cases. The positions of m, and m, were defined using rectangular 
coordinates, whereas those of the two off-axis masses were fixed by the 


and Czirjak’s four-body configurations, the 
authors’ solutions could, in turn, be used as 
limiting cases for ambitious future extensions 
of the n-body problem: perhaps three masses 
along a line plus two symmetrically placed 
equal masses; a test particle plus planar con- 
figurations of the type considered in the pre- 
sent work; or even planar arrangements of four 
different masses. = 
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Wired for sex 


Analysis ofa sensory neural circuit in the roundworm Caenorhabditis elegans 
reveals that its wiring is sex-specific, and arises through the elimination of 
connections that are originally formed in both sexes. SEE ARTICLE P.206 


DOUGLAS S. PORTMAN 


most meticulous neuroanatomist would 

be hard-pressed to identify the sex of its 
former owner. There are clearly male-female 
differences in some brain regions, but these 
can be subtle and variable, and their causes 
and consequences remain largely unclear. Over 
the past five years, work in several organisms’ 
has suggested that altered neural connectivity 


E presented with a human brain, even the 
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between brain regions might be a hallmark 
of male-female differences. On page 206 of 
this issue, Oren-Suissa et al.* provide clear 
evidence for sex differences in neural wiring 
in the roundworm Caenorhabditis elegans. 
Moreover, they report that these differences 
arise through sex-specific eradication of neural 
connections and are controlled by the genetic 
sex of the nervous system itself. 

In C. elegans, males are males, but females 
are properly called hermaphrodites. They 


produce a reserve of sperm as juveniles and 
can reproduce either by self-fertilization or 
by mating with a male. The nervous systems 
of both sexes have been studied in extra- 
ordinary detail. An adult hermaphrodite has 
precisely 302 neurons, and the complete set 
of connections between these neurons — the 
connectome — was mapped’ in the 1980s. 
These connections, called synapses, wire neu- 
rons up into circuits and are crucial for brain 
function. The complete hermaphrodite con- 
nectome remains the only one of its kind, and 
has provided unparalleled information about 
neural circuits and their control of behaviour. 

Oren-Suissa et al. began by comparing the 
hermaphrodite’s wiring diagram with the con- 
nectome of the adult male’s tail®, which was 
determined in 2012. The authors focused on a 
circuit of interconnected neurons that receives 
and interprets input from a tiny sensory organ 
in the tail called the phasmid. Unexpectedly, 
they identified seven synapses present in one 
sex but not the other, a phenomenon known 
as sexual dimorphism. For example, although 
two neurons called PHB and AVA are adjacent 
in both sexes, a synaptic connection between 
them seemed to exist only in hermaphrodites. 
In males, PHB connected instead to the nearby 
neuron AVG (Fig. 1). 

The investigators used genetic tools to 
fluorescently label these seven synapses, allow- 
ing the connections to be visualized in live 
animals. After confirming that the connections 
really were sexually dimorphic, Oren-Suissa 
and colleagues asked how sex influenced 
synaptic patterning. A key insight came 
from examining sexually immature animals. 
At five of the synapses, juvenile animals had 
both male- and hermaphrodite-specific con- 
nections, indicating that both sexes initially 
adopt a sexually intermediate state. Sexual 
dimorphism arises through the subsequent 
elimination of specific connections — a pro- 
cess called synaptic pruning. 

Although synaptic pruning is not well 
described in C. elegans, it is essential in the 
development of the mammalian nervous sys- 
tem, enabling changes in connectivity that are 
thought to underlie learning, memory and 
cognitive maturation. Cells called microglia, 
which specialize in synaptic pruning, have 
been implicated in the generation of sex differ- 
ences in the mammalian brain’. Thus, pruning 
could have an evolutionarily conserved role 
in shaping sex-specific differences in circuit 
connectivity. Furthermore, disrupted synap- 
tic pruning has been suggested to contribute 
to neuropsychiatric disorders that show sex 
biases, such as autism® and schizophrenia’. Per- 
haps differences in the pruning of particular 
connections can make one sex — in these 
cases, males — more susceptible to disease. 

Oren-Suissa et al. investigated the under- 
lying mechanisms that link sex to the pattern 
of pruning. In C. elegans, an individual's sex is 
determined by the number of X chromosomes 
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Figure 1 | Lost in maturation. Oren-Suissa et al.’ identify seven synaptic connections between neurons 
in the roundworm Caenorhabditis elegans that are sex-specific, including those between neurons called 
PHB and AVA, and those between PHB and AVG. In juvenile animals, these synapses are found in 

both sexes — male and hermaphrodite. As worms mature, synapses are eliminated in a manner that 
depends on the genetic sex of each neuron. In males, the connection between PHB and AVA is lost. This 
connection is maintained in hermaphrodites, whereas that between PHB and AVG is lost. Genetically 
switching the sex of PHB is sufficient to reverse this pattern of connections. 


it carries. Reversing this genetic signal in the 
nervous system — or even in individual neu- 
rons — has been shown to alter circuit physiol- 
ogy and behaviour”’. Using this approach, the 
researchers found that the genetic sex of indi- 
vidual neurons can also determine whether 
a particular synapse is pruned. For example, 
genetically ‘masculinizing’ the PHB neuron of 
a juvenile hermaphrodite usually led to elimi- 
nation of the PHB-AVA connection. More- 
over, the PHB-AVG synapse, which is typically 
pruned in hermaphrodites, was retained 
(Fig. 1). Thus, the stability of particular 
synapses is linked to genetic sex. 

At first glance, this role for genetic sex might 
seem inconsequential for mammals, in which 
information about sexual state is broadcast 
throughout the body by hormones — such 
as testosterone or oestrogen — released by 
the gonads. However, sexual differentiation 
of the mammalian brain also depends on its 
own genetic sex’’. The details of the differences 
caused by this effect remain largely mysteri- 
ous, and studies in invertebrates may provide 
insight into its workings. 

Oren-Suissa et al. next considered the 
consequences of sex-specific pruning on 
behaviour. In hermaphrodites, pruning gener- 
ates a phasmid circuit that regulates avoidance 
of noxious chemicals. In males, however, prun- 
ing links the phasmid to neurons that signal 
the presence of mates. The authors therefore 
wondered whether the phasmid circuit might 
be repurposed for male sexual behaviour. 
Although they did not test the behaviour of 
males that had a feminized phasmid circuit, 
they did find that disrupting the phasmid 
by disabling the PHB neuron compromised 
males’ ability to mate. 

This study leaves some questions 
unexamined, and raises fascinating new ones. 
It is easy to understand why juvenile males 
benefit from avoiding noxious chemicals, 
providing a rationale for hermaphrodi te-like 
connections in an immature male. But it is less 
obvious why young hermaphrodites would 
build male-like connections, only to prune 
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them later. It also remains unclear how a given 
neuron ‘knows’ which of its many synapses to 
prune. In mammals, pruning is often a conse- 
quence of decreased synaptic activity; perhaps 
this phenomenon plays a part in C. elegans, too. 
Finally, does genetic sex also prune connections 
in the worm’s head, which harbours circuits 
that control more-complex decision-making? 
And what might the consequences of this be? 
Oren-Suissa and colleagues’ study provides 
key support for the emerging idea that biologi- 
cal sex modulates the structure and function 
of neural circuits. In more-complex animals, 
sex differences in pruning might modulate 
functional connectivity, disease susceptibil- 
ity and cognition. Further studies of natural 
sexual variation in the brain are likely to pro- 
vide important insight into the workings of 
neural circuits. They should also shed light on 
the interactions between social and biologi- 
cal processes, and could perhaps even tell us 
something about what makes us human. = 
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Palaeoclimatic insights into forcing and 
response of monsoon rainfall 


Mahyar Mohtadi!, Matthias Prange! & Stephan Steinke! 


Monsoons are the dominant seasonal mode of climate variability in the tropics and are critically important conveyors 
of atmospheric moisture and energy at a global scale. Predicting monsoons, which have profound impacts on regions 
that are collectively home to more than 70 per cent of Earth’s population, is a challenge that is difficult to overcome by 
relying on instrumental data from only the past few decades. Palaeoclimatic evidence of monsoon rainfall dynamics 
across different regions and timescales could help us to understand and predict the sensitivity and response of monsoons 
to various forcing mechanisms. This evidence suggests that monsoon systems exhibit substantial regional character. 


eliable prediction of summer monsoons is critical to mitigating 

the often catastrophic consequences of monsoon rainfall anom- 

alies, such as floods and droughts, famine, and economic losses. 
Yet, despite its pivotal role in the livelihood of billions of people, summer 
monsoon rainfall remains difficult to predict!~. It is expected that total 
monsoon precipitation in the Northern and Southern hemispheres will 
change in opposite directions in the coming decades, owing to differ- 
ences in hemispheric warming*. Regional differences are expected to be 
large, and some monsoon regions are predicted to receive more rain- 
fall despite a projected weakening of the monsoon circulation. Even 
though the latest generation of climate models show substantial overall 
improvements compared to their predecessors”, the extent and inten- 
sity of monsoons are still often under-simulated’, with most models 
showing systematic errors in the seasonal cycle and in intra-seasonal-to- 
interannual variability>°. Moreover, there is low confidence in projections 
of future changes in the amount of rainfall for several monsoon domains". 
For instance, projection of the South Asian (Indian) summer monsoon 
rainfall is a key challenge for global and regional circulation models’. 
Poor simulations of monsoon dynamics arise mainly from the complex- 
ity of these land-ocean-atmosphere coupled systems that interact with 
nearly all other tropical and extratropical climate phenomena, such as 
the El Nifio-Southern Oscillation (ENSO)? and the Hadley and Walker 
circulations® (Box 1). 

The monsoon concept has recently changed from a regional to a global 
one. The new idea of a “global monsoon”®” takes into account a coherent 
response of all monsoon systems, regardless of regional differences, to 
changes in global-scale atmospheric circulation patterns forced by the 
annual cycle of solar radiation and land-air-sea interactions®. Modern 
climatology of monsoon domains (Fig. 1) is well depicted by the concept 
of a global monsoon’, and models are substantially better at represent- 
ing the global monsoon than the regional monsoons". However, several 
reconstructions and transient simulations of past monsoon variability 
emphasize that it has a substantial regional character, rather than being 
characterized by the common, global dynamics of the monsoon systems at 
different timescales!®''. Study of monsoon evolution beyond the instru- 
mental record is thus essential to improve our understanding of natural 
and anthropogenic forcing mechanisms of monsoon rainfall and its inter- 
actions and teleconnections on various timescales, which will enable us to 
test and enhance climate models in their ability to represent and predict 
monsoon dynamics, and to contribute to the climate change discussion. 

Within the past decade, palaeoclimate studies have provided new 
insight into past monsoon variability with unprecedented temporal and 


spatial coverage. Here we evaluate the forcing mechanisms of variability in 
summer monsoon rainfall as evidenced by palaeoclimate research (Fig. 2), 
and identify several aspects of monsoon dynamics that will improve our 
understanding of future monsoon response to specific forcing. The scope 
of this Review is constrained by the available monsoon reconstructions 
and the timeliness of climate model simulations, and excludes oceanic 
monsoon domains, highly uncertain forcing mechanisms and tectonic 
forcing. 

The palaeoclimatic evidence shows that monsoon dynamics are 
strongly shaped by large-scale meridional temperature gradients and the 
related position of the intertropical convergence zone!” (ITCZ; Box 1). 
However, study of past monsoons also reveals that these temperature gra- 
dients are sensitive to many types of forcing, the influence of which seems 
to vary in time and space. Until climate models converge on the simu- 
lation of monsoon forcings, a robust projection of future monsoons will 
remain elusive!?. In this Review, we discuss the main forcings of monsoon 
variability and their uncertainties, and argue that a coordinated effort to 
quantify past variations in meridional temperature gradients, including 
site-specific monsoon reconstructions, will provide a crucial test bed for 
model improvement. 


Orbital forcing 

Changes in the tilt of Earth’s axis (obliquity) as well as axial and apsidal 
precession (wobble of Earth’s axis and rotation of Earth’s elliptical orbit 
over time, respectively) modulate the temporal and spatial distribution of 
insolation. Precession, with periods of about 19 kyr and 23 kyr, affects the 
seasonal cycle of incoming solar radiation and its hemispheric distribu- 
tion, and is therefore considered a major control on changes in monsoon 
intensity’. A prominent example of this control was during the early-to- 
mid Holocene, when higher-than-today Northern Hemisphere summer 
insolation rendered the North African monsoon strong enough to turn 
the Sahara desert into a steppe or savannah landscape'”®. 

Generally, monsoon rainfall and seasonality (that is, the amplitude 
of the annual rainfall cycle) are enhanced in the Northern Hemisphere 
and reduced in the Southern Hemisphere during a precession minimum 
(when the Northern Hemisphere summer solstice occurs at perihelion— 
the point in the orbit of the Earth at which it is nearest to the Sun) and vice 
versa during a precession maximum (when the Northern Hemisphere 
summer solstice occurs at aphelion—the point in the orbit of the Earth 
at which it is farthest from the Sun); see Fig. 3. Substantial support for 
this view is provided by more than a decade of research on stable oxy- 
gen isotope (5/80) records of high-resolution and absolute-dated cave 


I1MARUM—Center for Marine Environmental Sciences, University of Bremen, 28359 Bremen, Germany. 


12 MAY 2016 | VOL 533 | NATURE | 191 


© 2016 Macmillan Publishers Limited. All rights reserved 


REVIEW 


BOX | 
Key characteristics of monsoon 
systems and related climate 
phenomena 


Intertropical convergence zone (ITCZ). The ITCZ is a tropical belt of 
maximum precipitation that results from deep convection that migrates 
seasonally towards the warming hemisphere?’. Over land, it moves back 
and forth across the equator following the zenith point of the Sun. The 
zonal-mean position of the ITCZ is associated with the rising branch of 
the global Hadley cell. 

Tropical and subtropical monsoons. |n tropical monsoons, 
precipitation occurs almost entirely within an ITCZ that is seasonally 
displaced in the presence of cross-equatorial pressure gradients. The 
onset of tropical monsoons correlates with a distinct change in the 
wind shear, such as in the South Asian monsoon2®43°9, In subtropical 
monsoons, precipitation is mainly controlled by the position and 
migration of the subtropical highs and frontal systems. For the East Asian 
monsoon, the jet stream and large-scale topography are also crucial to 
the seasonal evolution of rainfall?®. 

Dynamic and thermodynamic mechanisms. Monsoon rainfall is 
affected by dynamic and thermodynamic mechanisms. Dynamic 
mechanisms refer to changes in winds with unchanged water-vapour 
concentration in the air. Thermodynamic mechanisms refer to changes 
in water-vapour concentration with unchanged winds. Thermodynamic 
mechanisms dominate the hemispherically antisymmetric, annual- 
mean precipitation response to precession in the absence of land-sea 
contrasts**. 

Atlantic Multidecadal Oscillation (AMO). The AMO is a mode of 
natural climate variability characterized by a coherent pattern of 
variability in basin-wide North Atlantic sea surface temperature (SST) 
with a period of 60-80 years, and is associated with multidecadal 
variations in the strength of the Atlantic overturning circulation!©. 

Pacific Decadal Oscillation (PDO). The PDO is the leading empirical 
orthogonal function of monthly SST anomalies over the North Pacific, 
and is closely related to the strength of the wintertime Aleutian low- 
pressure system. It is characterized by SST anomalies with a period of 
20-30 years and opposite signs in the western extratropical and eastern 
Pacific. 

EI Nifio-Southern Oscillation (ENSO). The ENSO is a naturally 
occurring, interannual fluctuation in equatorial Pacific SST, with a warm 
(El Niflo) and a cool (La Nifia) phase. Its atmospheric component, the 
Southern Oscillation, is measured by the sea-level pressure difference 
between Darwin, Australia and Tahiti, French Polynesia. The central- 
Pacific El Nifio is characterized by positive SST anomalies in the central, 
rather than the eastern, equatorial Pacific. 

Walker circulation. Walker circulation describes thermally direct, 
equatorial, zonal overturning circulation that converts available potential 
energy to kinetic energy of atmospheric motion. Over the Pacific Ocean, 
a zonal sea-level pressure gradient causes surface air to move from 
high pressure (caused by sinking motion) in the eastern Pacific to 
low pressure (caused by rising motion) in the western Pacific. Walker 
circulation is intrinsically connected to the ENSO, with a stronger flow 
during La Nifia and a weaker flow during El Nifio. 

Hadley circulation. Hadley circulation describes tropical, meridional 
overturning circulation and is quantified by a mass-flux stream function. 
Strong diabatic heating near the thermal equator results in ascending air 
that spreads poleward, descends at subtropical high-pressure zones of 
both hemispheres, and flows towards the equator near the ocean or land 
surface. During the summer monsoon season, the Hadley circulation 
changes to a distinctly asymmetric flow, with ascent in the summer 
hemisphere and substantial cross-equatorial transport of energy and 
moisture?®°8, 


stalagmites that vary with 19-kyr or 23-kyr periodicity, similarly to pre- 
cession, with an anti-phased interhemispheric relationship'”'®. Recently, 
climatic interpretations of the stalagmite 5!8O variability, particularly in 
the East Asian monsoon domain, have become controversial, with an 
overwhelming and growing number of observational and palaeoclimate 
data and model studies suggesting that the intensity of the local summer 
monsoon rainfall is not the only control on this proxy’. 

Despite this ongoing controversy surrounding the interpretation of 
stalagmite 8!80 from the East Asian monsoon domain, model simulations 
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support the notion that precession is a strong control on rainfall in all 
monsoon domains (Fig. 3). During a precession minimum, the Northern 
Hemisphere summer is characterized by higher insolation and a stronger 
land-sea thermal gradient, which increase the atmospheric humidity 
(the thermodynamic component of monsoon rainfall) and wind circula- 
tion intensity (the dynamic component) (Box 1), whereas the Southern 
Hemisphere summer experiences the opposite scenario”. Simulations 
of changes in the spatial distribution of rainfall due to precession do not 
support a simple meridional shift in the seasonal position of the ITCZ, 
but do indicate a shift in the rainfall between land and ocean (Fig. 3a, b). 
Here, higher insolation causes the early-summer surface temperature 
to increase much faster over land than it does over the ocean, so that 
the maximum near-surface equivalent potential temperature (a quantity 
related to the stability of a column of air: if the equivalent potential tem- 
perature at the surface is greater than aloft, then the column is unstable 
and convection can occur) is shifted over land before the onset of the sum- 
mer monsoon, resulting in enhanced land precipitation at the expense of 
rainfall over the adjacent sea”®. By contrast, the maximum near-surface 
equivalent potential temperature and the precipitation centroid tend to 
stay over the ocean when insolation is low”®. 

Changes in obliquity with a period of 41 kyr affect the seasonality of 
incoming solar radiation equally in both hemispheres, with stronger var- 
iation in incoming radiation at high latitudes. Despite weak changes in 
incoming solar radiation at low latitudes, palaeoclimate data and model 
studies suggest that obliquity has a substantial effect on the strength of 
monsoon systems!!!°5.26, with increased summer monsoon rainfall when 
obliquity is maximal (Fig. 3c, d). A key factor in the traditional view of the 
(indirect) effect of obliquity on monsoons is high-latitude remote climate 
forcing associated with Northern Hemisphere ice sheets and sea-ice’. 
Obliquity-induced changes in cryosphere extent have been suggested to 
affect the monsoon domains by changing the oceanic and atmospheric cir- 
culations and the trajectories of moisture advection on glacial-interglacial 
timescales. Recent modelling results suggest that these changes are small 
for the South Asian monsoon, which becomes drier during glacial periods 
owing to lower temperatures”? (a thermodynamic mechanism). However, 
for the East Asian monsoon domain, circulation changes are critically 
large and result in moisture convergence from the Pacific Ocean and 
increased precipitation, partly due to redistribution of air mass from 
the continents to the oceans as ice sheets grow, thereby enhancing the 
subtropical high-pressure system over the Pacific Ocean” (Fig. 4). In 
addition, the waxing and waning of ice affect the subtropical monsoons 
(Box 1) by changing the latitudinal temperature gradient and the position 
of the jet stream—a critical component of the East Asian monsoon™””85— 
and by inducing stationary planetary waves that strengthen the East Asian 
summer monsoon when ice grows’. The release of meltwater into the 
North Atlantic by waning ice sheets suppresses the North American mon- 
soon more than any other monsoon domain”. 

Recent studies suggest that obliquity also has a direct effect on mon- 
soon rainfall by changing the meridional insolation gradient in the sum- 
mer hemisphere and the interhemispheric insolation gradient”°*". The 
direct insolation-gradient forcing by obliquity can thus greatly contrib- 
ute to changes in monsoon intensity at many locations without involv- 
ing the (indirect) high-latitude remote climate forcing associated with 
Northern Hemisphere ice sheets”>”° (Fig. 3). The presence of 41-kyr peri- 
odicity in West African monsoon records during the warm Pliocene”’, 
that is, before the onset of strong Quaternary ice-sheet feedbacks in the 
Northern Hemisphere, supports the model-derived finding of a direct 
insolation-gradient forcing. 

The timing of the response of different monsoon systems to precession 
and obliquity forcing and the degree to which they respond are matters of 
debate. Reconstructions of rainfall from the South American® and east- 
ern North African!! monsoons point to precession as the main forcing, 
whereas other reconstructions consider obliquity forcing to be at least 
of comparable importance for the West African!! and South Asian! 
monsoons. On the other hand, model studies consistently suggest that, 
when ice sheet feedbacks are disabled, precession has a more severe 
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Figure 1 | Global monsoon domain (coloured regions) as defined by 
the seasonality (summer-winter difference) in rainfall. For the 
calculation of the monsoon domain, we applied the criteria of ref. 9—which 
require that the annual precipitation range (local summer-minus-winter 
precipitation) is larger than 6cm month and exceeds 50% of the 
annual-mean precipitation—except that the boreal summer was defined 
as June-August and the austral summer as December-February. 
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Colours denote annual-mean precipitation. Monthly global gridded, 
high-resolution (0.5°) precipitation data from 1901-2010 from the 
University of Delaware were used. Regional monsoon areas are: North 
America (N. Am.), South America (S. Am.), North Africa (N. Afr.), 
southern Africa (S. Afr.), South Asia (S. As.) or India, East Asia (E. As.) 
and Australia—Indonesia (Aus.-Ind.). 
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Figure 2 | Basic components of a summer monsoon and its driving 
forces. Summer solar radiation heats the surface. A land-ocean thermal 
contrast, resulting from a lower surface heat capacity of the land compared 
to the ocean, causes the low-level inflow of moist air from ocean to land 
during summer and the rising of air over the landmass. Recent studies 
of monsoon dynamics have de-emphasized the relevance of land—ocean 
surface contrasts and suggested that interactions between extratropical 
eddies and the tropical circulation are essential for the development 

of monsoons”’. Condensation of water vapour in the rising air leads 

to the release of latent heat and rainfall. The latent-heat release in the 
troposphere reinforces the circulation and helps to pull in additional 
moisture from ocean to land. Depending on the vegetation cover, 
evapotranspiration on land may further feed the monsoon rain via local 
water recycling. Evapotranspiration may also affect the dynamics of the 


monsoon system via evaporative cooling of the land surface. Aerosols 
can affect the land-sea thermal contrast and, hence, the monsoon by 
modifying the transfer of solar radiation through the atmosphere by 
scattering and absorption processes. Aerosols may also have an effect 
on cloud microphysics, including changes in the radiative properties, 
frequency and lifetime of clouds. Because the various components of 
the monsoon system are closely coupled through feedback mechanisms, 
perturbation of any component may cause a chain reaction that affects 
the monsoon system as a whole. Different forcings perturb different 
components of the monsoon system, which then affect the entire system 
via feedback mechanisms, and act on different timescales. AMO, Atlantic 
Multidecadal Oscillation; AMOC, Atlantic meridional overturning 
circulation; ENSO, El Nifio-Southern Oscillation; PDO, Pacific Decadal 
Oscillation. 
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Figure 3 | Effects of obliquity and precession on tropical rainfall. 

a-d, Precipitation anomalies (in millimetres per day; colour scale) as 
simulated by the high-resolution (1.125° horizontal resolution with 

62 levels in the atmosphere), fully coupled, atmosphere-ocean general 
circulation model EC-Earth?°. Pmin — Pmax (a, b) refers to the difference 
between the minimum and maximum climatic precession; Tmax — Tmin 
(c, d) refers to the difference between the maximum and minimum 
obliquity (axial tilt). Results are shown for boreal summer (June-August; 
a, c) and austral summer (December-February; b, d) mean precipitation. 
Applied minimum and maximum values for obliquity and climatic 
precession correspond to extreme values of the orbital parameters during 
the last one million years. Obliquity was set to a minimum value in a and b. 
In cand d, a circular orbit was assumed; that is, climatic precession was 
set to zero. During precession minimum (Pmin), summer solstice occurs at 
perihelion such that seasonality is enhanced in the Northern Hemisphere 
and reduced in the Southern Hemisphere. During precession maximum 
(Pmax)s Summer solstice occurs at aphelion. All other boundary conditions, 
such as the solar constant, greenhouse gas concentrations, sea level, ice 
sheets and vegetation, were kept fixed at pre-industrial levels during the 
model experiments. Details of the experimental set-up are described in 
ref. 25. Figure courtesy of J. H. C. Bosmans, University of Utrecht. 


impact on monsoon rainfall than does obliquity”? (Fig. 3), but with- 
out a consistent response of the specific monsoon systems to orbitally 
forced insolation changes. This is in part due to the varying influence 
of the internal feedback mechanisms such as ocean-atmosphere inter- 
action. For instance, the sign and magnitude of sea surface temperature 
(SST) feedbacks on monsoon rainfall are equivocal in model simulations, 
with some implying a very limited effect on the South American™ and 
East Asian** monsoons and others suggesting a large impact on the East 
Asian monsoon”. The influence of SST has been shown to amplify the 
insolation-induced strengthening of the African** and South Asian*° 
monsoons, and to reduce the intensification of the South Asian mon- 
soon*, Mechanistically, in some models, SST increases promote inland 
advection of moist air from the tropical Atlantic into the North African 
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Figure 4 | Monsoon variability at different timescales as evidenced 

by 5180 of cave stalagmites. Ensemble empirical mode decomposition 

of cave speleothem 6!8O records from the South Asian (red) and East 

Asian (blue) monsoon domains. Values are relative to Vienna Pee Dee 

Belemnite (VPDB). The bottom four panels show the intrinsic components 

decomposed from the speleothem 6'8O records (top panel) ordered by 

timescales, as indicated, illustrating the stalagmite 6'%O variance at each 

specific timescale. Summer (July-August) insolation at 30° N is shown 

in grey in the fourth panel. The lack of glacial-interglacial variability in 

the East Asian (blue) record in the bottom panel can be explained by the 

cancelling effect of increased precipitation (decrease in 6'8O) and a greater 

contribution of the Pacific moisture source (increase in 6'8O). Image 

adapted from ref. 23, National Academy of Sciences. 


monsoon domain*, but shift the rainfall centroid towards the ocean in the 
South Asian monsoon domain”®”*. By contrast, other models suggest 
that more divergent upper-tropospheric flow above the Arabian Sea and 
greater atmospheric water availability act as a positive feedback on the 
South Asian monsoon rainfall*?. 

The inconsistent and, in part, out-of-phase behaviour of the monsoon 
domains in response to insolation forcing is also due to their resonant 
response to insolation changes**. Transient simulations suggest that the 
subtropical monsoon systems are less sensitive to insolation forcing than 
are the tropical monsoons in terms of total rainfall, because the insolation- 
induced changes in summer rainfall are partly counter-balanced by 
precipitation changes in other seasons!”. The sensitivity of the tropical 
monsoons is also not uniform: the North African monsoon is most sen- 
sitive to summer insolation, whereas the South Asian monsoon is most 
sensitive to spring-to-early-summer insolation*’. It has been suggested 
that the reason for this non-uniformity is a resonant response of the 
South Asian monsoon to insolation forcing when maximum insolation 
anomalies occur near the summer solstice (for example, during the early 
Holocene) and a resonant response of the African monsoon—which has 
its rainfall maximum one month later in the annual cycle than the South 
Asian monsoon—when the maximum insolation change is delayed after 
the summer solstice (for example, during the middle Holocene)*. 

In summary, it appears that the different responses of the individ- 
ual monsoons to orbital forcing arise from their differing seasonal 
cycles that are influenced by regionally distinctive internal feedbacks. 
However, orbital forcing affects all the monsoon domains by changing 
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Figure 5 | Monsoon rainfall anomalies during Heinrich stadial 1. 
Changes in rainfall during HS1 (15-17 kyr before present) relative to the 
Last Glacial Maximum (LGM, 19-21 kyr before present) as indicated by 
monsoon proxy records (filled circles) and simulated by the TraCE-21k 
climate model** (annual-mean precipitation anomalies, colour shading). 
The TraCE-21k simulation was performed with the coupled atmosphere- 
ocean general circulation model CCSM3 (Community Climate System 
Model, version 3). The resolution of the atmospheric component is 3.75° 


the meridional gradient in insolation and, hence, heating; considering the 
recent differential hemispheric warming that will probably increase in the 
future’, exploring this aspect in palaeo-monsoon studies may help to bet- 
ter assess and predict future changes in monsoon rainfall. In addition, it 
has been suggested that obliquity-induced changes have some qualitative 
similarities with global-warming-induced changes, with respect to the 
summer-hemisphere Hadley cell driven by baroclinic eddies*!. Therefore, 
a deeper understanding of obliquity-induced changes in tropical circula- 
tion is crucial for more reliable predictions of climate change. 


North Atlantic forcing 

Today, a northward cross-equatorial, ocean heat transport of about 0.5 PW, 
which is mainly due to the Atlantic meridional overturning circulation 
(AMOC), results in relatively warm surface waters in the northern North 
Atlantic and renders the Northern Hemisphere warmer than the Southern 
Hemisphere. In response, there is a cross-equatorial, southward transport 
of energy via the mean Hadley circulation, as a result of the ascending 
branch of the Hadley circulation and the mean position of its ITCZ being 
located north of the Equator*”. Any physical mechanism that induces 
anomalous warming or cooling of one hemisphere relative to the other 
will cause a shift in the mean position of the ITCZ towards the warming 
hemisphere*®*?, which will affect tropical monsoon systems. 

Heinrich and Dansgaard-Oeschger events of the last glacial period 
serve as paradigms of oscillations in interhemispheric temperature asym- 
metry“. During these millennial-scale climate fluctuations, changes in 
the interhemispheric thermal gradient were probably caused by variations 
in the AMOC" and its associated heat transport, and amplified by sea-ice 
feedbacks*. As a result, the largest temperature anomalies appeared in 
the North Atlantic realm*°“”; nevertheless, millennial-scale glacial climate 
variability is well expressed in numerous palaeo-monsoon records from 
all over the globe’”'®*? (Fig. 4). 

The largest body of data exists for the last Heinrich stadial (HS1, about 
18-15 kyr before present) when the AMOC was nearly shut off and sur- 
face temperatures in the North Atlantic realm were at a minimum“. 
A compilation of available palaeo-hydrological records for HS1 suggests 
a general pattern of a global-scale, southward ITCZ shift associated with 
changing interhemispheric thermal asymmetry (Fig. 5). Specifically, sub- 
stantial drying in the North African and South Asian monsoon regions 
is found, whereas the Australian—Indonesian and South American 
monsoon systems become wetter. Speleothem isotope records from the 
East Asian monsoon system also indicate dry conditions during HS1, 
although model studies cast doubt on the local-rainfall interpretation 
of these records (see above). The pattern of HS1 precipitation response 


in the horizontal direction with 26 layers in the vertical direction. Starting 
from the LGM, the model was integrated to the present-day, subject to 
realistically varying forcings by orbital insolation, atmospheric greenhouse 
gas concentrations, continental ice sheets, and meltwater fluxes. Filled red 
and blue circles indicate negative and positive monsoon rainfall anomalies, 
respectively. Filled white circles indicate no change or uncertain changes 
during HS1 compared to the LGM. Only records from monsoon domains 
as indicated in Fig. 1 are shown. For references, see main text. 


for the southern African monsoon region appears to be more complex. 
While the northern portion of the southern African monsoon region 
becomes drier, the southern portion witnesses anomalously wet condi- 
tions during HS1 (Fig. 5). 

Owing to the global distribution of HS1 monsoon records, model- 
data comparison for this climate event offers a unique possibility for test- 
ing the reliability of climate model simulations with respect to AMOC 
slowdown. The TraCE-21k transient simulation of the last deglacia- 
tion supports the notion that AMOC intensity is a dominant control 
on tropical rainfall patterns on millennial timescales***°. A compari- 
son of the HS1 hydroclimatic records with the TraCE-21k-simulated 
HSI precipitation response shows agreement in terms of the sign of 
the rainfall anomalies over most monsoon regions (Fig. 5). However, 
discrepancies between the model and the data stand out in southeastern 
Brazil, close to the South Atlantic convergence zone. Furthermore, 
reconstructed dry conditions in the South Asian monsoon region appear 
to be greatly underestimated by TraCE-21k. The reason for this weak 
rainfall response in the model is still uncertain, but we are beginning 
to understand the teleconnections between North Atlantic SST forcing 
and the South Asian summer monsoon. Several studies have examined 
possible teleconnections, and different mechanisms have been proposed, 
including a tropospheric temperature anomaly over Eurasia related to 
the North Atlantic Oscillation*”, a pathway over the Pacific Walker cir- 
culation*’, a southward shift of the subtropical westerly jet over Africa 
and Asia‘, and a wave-like atmospheric teleconnection between the 
northern North Atlantic and India*®. 

As well as abrupt climate events during the last glacial period and 
deglaciation, several palaeo-monsoon records also witness centennial- 
to-millennial-scale climate variability during the Holocene associ- 
ated with North Atlantic cold phases, such as the effect of the North 
Atlantic cold event that took place 8.2 kyr ago and is akin to the HS1 
rainfall anomaly pattern'”!***, Operating on a much shorter times- 
cale, the Atlantic Multidecadal Oscillation (AMO; Box 1) is a mode of 
natural climate variability characterized by a coherent pattern of vari- 
ability in basin-wide North Atlantic SST with a period of 60-80 years. 
Observational data suggest that North Atlantic SST associated with the 
AMO has an influence on multidecadal global monsoon variability*', 
with the overall pattern of changes in global monsoon rainfall during 
the cold phase of the AMO resembling the HS1 anomaly pattern (Fig. 5). 
However, owing to the shortness of the instrumental record, the robust- 
ness of these links is difficult to assess. High-resolution proxy records 
enable us to extend the instrumental monsoon records to the past few 
thousand years*°?->4, 
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In view of a potential slowdown of the AMOC in the coming dec- 
ades*°, understanding the connection between oceanic heat transport 
and monsoon systems is of paramount importance for predicting the 
fate of the monsoons. Meanwhile, a growing body of palaeo-monsoon 
records allows us to paint a global picture of changes in monsoon rainfall 
in response to large-scale oceanic redistribution of heat. 


CO, forcing 

Models predict an increase in total monsoon rainfall during the twenty- 
first century in response to rising atmospheric greenhouse gas (GHG) 
forcing, increasing atmospheric moisture content, and an expansion 
of the area affected by monsoons". Isolating the effects of atmospheric 
GHG forcing on monsoon dynamics from palaeoclimatic records is dif- 
ficult because changes in GHG concentrations and other forcings usually 
occurred contemporaneously. Not only did GHG concentrations oscillate 
in concert with Quaternary glacial—interglacial cycles, but so did the size 
of ice sheets, the area of sea-ice, and the orbital forcing. Traditionally, dry 
conditions in North Africa during Quaternary glacial stages have been 
attributed to expansion of the boreal cryosphere and North Atlantic cool- 
ing’. However, recently published results from the TraCE-21k transient 
simulation of the last glacial termination suggest that the GHG forcing 
plays an important role in developing wet interglacial conditions in the 
African monsoon regions*®. Conversely, the lack of glacial-interglacial 
variability in rainfall proxy records from deep tropical monsoon systems 
that reach beyond the last glacial termination indicates that GHG forc- 
ing plays a limited role and that the boreal cryosphere has a stronger 
influence”®. 

The Pliocene warm period with a relatively high atmospheric CO con- 
centration is often considered as a potential analogue of future climate. 
Palaeoclimatic evidence suggests wet Pliocene conditions in the West 
African monsoon region, including the presence of Saharan palaeo-rivers, 
while forests, woodland and savannah extended further north compared 
to the Pleistocene*’. For the East Asian monsoon region, proxy data and 
climate models suggest stronger-than-modern summer winds associ- 
ated with wetter conditions during the mid-Pliocene warm period**. To 
what extent the wetter Pliocene conditions can be attributed to direct 
GHG forcing of the monsoons remains uncertain, because other potential 
forcing factors (for example, ice sheets, topography and global ocean cir- 
culation) were also different from today. Yet additional Pliocene records 
of rainfall from other monsoon regions will help identifying the sign of 
change in a warmer-than-today Earth. Isolating the effects of atmos- 
pheric GHG forcing on monsoon dynamics remains a critical task for 
palaeo-monsoon studies, to narrow down uncertainties in projections of 
future monsoon rainfall. 


ENSO and PDO 

Similarly to the AMO (see above), instrumental records suggest that SST 
anomalies in the tropical Pacific associated with the ENSO are a predom- 
inant forcing of monsoon variability in modern climate’. ENSO-related 
SST anomalies affect the global atmospheric circulation, particularly the 
Walker circulation (Box 1). Generally, the east-west displacement of the 
ascending and descending branches of the Walker circulation during El 
Nijio years results in an increased descent over Australasia and reduced 
monsoon rainfall**. However, this relationship appears unstable in the 
instrumental records, partly as a result of the different types of the ENSO, 
which are characterized by spatially varying SST anomalies (between the 
central and eastern equatorial Pacific; Box 1) and teleconnections™, and 
highlights the need for palaeoclimatic evidence beyond the instrumental 
records. 

Proxy records of monsoon rainfall in North America support a leading 
role of the ENSO and the Pacific Decadal Oscillation (PDO; Box 1) after 
the Northern Hemisphere summer and autumn insolation declined about 
4,000 years ago by changing the Pacific SST°’, the Pacific—Atlantic SST 
gradient™, or the position of the ITCZ®. Whether the observed changes 
in monsoon precipitation are caused by a combination of forcings*, the 
ENSO only®!®4, or solar forcing® remains controversial. This is in part 
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due to the fact that different forcings give rise to similar responses within 
the same monsoon domain. For example, La Nina, positive AMO and 
negative PDO cause a wet Mesoamerica and dry southwest USA, whereas 
El Nifo, negative AMO and positive PDO give rise to a dry Mesoamerica 
and wet southwest USA. More importantly, the suggested links to ENSO, 
PDO or AMO rely mainly on simple statistical analyses of the rainfall 
proxy data such as power spectra or correlation coefficients that are not 
sufficient to infer any causal mechanisms. 

Palaeoclimate data from the Australasian monsoon regions imply that 
the ENSO has a heterogeneous spatiotemporal impact even within the 
same monsoon domain. A 50-year stalagmite record from northeast 
India suggests that El Nifo and positive PDO have diminished mon- 
soon precipitation in central India and shortened the moisture transport 
pathways to northeast India during the past decades®. Tree-ring data 
of the last millennium suggest that a simple canonical form of ENSO 
influence is insufficient to explain Asian monsoon variability, probably 
owing to the different pathways through which ENSO interacts with 
the different components of the monsoon”. Secular changes in ENSO 
teleconnections further complicate a general assessment of ENSO- 
related rainfall variability and make it somewhat regionally dependent! 
Finally, the lack of indisputable and continuous benchmark records 
for the ENSO, PDO and AMO is another obstacle in untangling their 
relationship to monsoons. While many independent reconstructions 
of the AMOC, atmospheric CO, concentration and solar activity are 
available, data relating to the variability in the ENSO, PDO and AMO, 
in forcings and responses, and in lead and lag times on scales longer 
than a few decades do not exist. Therefore, reconstructing past changes 
in the monsoon-ENSO-PDO relationship remains one of the most 
critical contributions of palaeoclimate research to the climate change 
discussion. 


Land cover 

Land-cover changes alter surface roughness, albedo and water fluxes, 
thereby affecting the energy and moisture budgets of monsoon systems 
(Fig. 2). Over the Tibetan plateau, expansion of vegetation has been 
observed in response to recent warming, leading to enhanced evapo- 
rative cooling with potential effects on the South Asian monsoon®. On 
the other hand, deforestation in Mexico has been identified as a drought 
amplifier®, and human-induced land-cover changes in China over the 
past 3,400 years have been suggested to weaken the East Asian summer 
monsoon”. Observational evidence from the period 1981-2003 suggests 
that the decrease in the South Asian monsoon rainfall may have been 
caused by agricultural intensification’’, and model simulations indicate 
that land-use changes and deforestation may cause a locally delayed” 
and reduced monsoon rainfall in the Northern Hemisphere by shifting 
the ITCZ southward”? 

Although anthropogenic land-cover changes may act as a forcing for 
shifts in monsoons, a dynamic vegetation cover may also act as a feedback 
to changes in monsoon rainfall, especially in North Africa. Pioneering 
work by Charney” has suggested a strong positive feedback between 
vegetation cover and monsoon rainfall in the Sahel region. On the basis 
of these ideas it has been proposed that the effect of expanded North 
African vegetation cover on surface albedo would have been crucial in 
amplifying the orbitally triggered, early-to-mid Holocene, West African 
monsoon rainfall anomaly—the so-called African humid period, during 
which the Sahara was much ‘greener’ than it is today’*. Provided that 
the positive vegetation-rainfall feedback is strong enough to introduce 
nonlinear dynamics into the climate—vegetation system, two equilibria 
of the regional atmosphere-vegetation state may exist: a humid/green 
state and a dry/desert state. A transition from the humid state to the 
dry state by a catastrophic bifurcation was suggested to have abruptly 
terminated the African humid period around 5.5 kyr ago’. Other studies 
have questioned both the abruptness of the large-scale North African 
climate transition’>”* and the existence of a strong local Charney feed- 
back, which operates through changes in surface albedo””. Instead, a 
recent model study has suggested a positive vegetation-rainfall feedback 
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during the Holocene that operated via anomalies in surface latent-heat 
flux caused by canopy evaporation and transpiration and their effect on 
the mid-tropospheric African easterly jet’*. Another study highlighted the 
role of remote forcing from expanded forest cover in Eurasia in amplifying 
North African rainfall during the early-to-mid Holocene”. Both feedback 
mechanisms are specific to the North African monsoon and cannot be 
generalized to the other monsoon regions*”. 

In summary, forcing and feedbacks associated with land-cover changes 
may have important effects on the different regional monsoon systems. 
Even though quantification of monsoon-vegetation feedbacks by means 
of proxy data alone will be difficult, reconstructions of past land and 
vegetation cover are indispensable to test model-derived hypotheses and 
may aid in the improvement of parameterizations in land surface models. 


Solar and volcanic forcing 

Solar and volcanic activities are considered major external forcings 
of climate variability for the last millennium and beyond***". Several 
studies suggest that short-term changes in solar activity, such as the 
11-year sunspot cycle, may affect monsoon intensity. In some parts of 
the South Asian monsoon domain, the average precipitation anomalies 
in the five most-recent peaks in the sunspot cycle reached values as high 
as 20% above normal, and suggest that solar activity influences mon- 
soon strength*”. A response to the 11-year solar cycle was also found for 
the East Asian monsoon®’, with the band of heaviest rain penetrating 
further north when sunspot numbers were high. However, observa- 
tions of sun—monsoon relationships are based on relatively short time 
series and, hence, are inherently uncertain. High-resolution speleothem 
records from Northern Hemisphere monsoon domains suggest a posi- 
tive correlation between solar activity and Holocene monsoon rainfall 
on longer timescales®****°, Because changes in solar forcing are rather 
small, any noticeable effect on climate requires nonlinear responses and 
amplifying feedbacks*’. ‘Top-down’ processes, based on solar heating 
of the stratosphere and changes in stratospheric ozone concentration 
via modification of photo-dissociation rates, are considered to be cru- 
cial to most climatic phenomena associated with solar forcing®’. Yet 
‘bottom-up’ processes, acting via solar-induced changes in SST, may also 
be important in the tropics, owing to positive feedbacks associated with 
surface evaporation, moisture fluxes and latent-heat release in monsoon 
systems®°8”, 

Recent observations, palaeoclimate data and model studies show 
that volcanic forcing is probably more important than solar forc- 
ing on a hemispheric-to-global scale, and drove a large portion of the 
interannual-to-multidecadal monsoon variability during the late 
Holocene by affecting the SST, ENSO and AMO®!*8. The primary 
effect of volcanic sulfate aerosols in the stratosphere is to cool the surface 
of Earth by reflecting incoming solar radiation. As post-eruption cooling 
is generally stronger over land than it is over the ocean, a weaker summer 
monsoon circulation should result from a reduced land—ocean thermal 
contrast (see Fig. 2). Model results suggest that, after large volcanic erup- 
tions, cooling over east Asia is stronger than that over the tropical ocean, 
favouring weaker East Asian summer monsoon circulation?!. However, 
tree-ring data of the past 750 years contradict model results regarding 
the sign of the monsoon response in the year of eruption, and suggest 
an anomalously wet southeast Asia and dry conditions over central 
Asia”. Ona global scale, large, explosive volcanic eruptions may cause 
reorganizations of the Hadley cell and a shift of the ITCZ away from the 
hemisphere with the greater aerosol concentration”. 

The magnitudes of solar and volcanic forcing and their effects on the 
hydrologic cycle are sources of large uncertainty in most model simula- 
tions, owing to the common model deficiencies in capturing precipitation 
variability at decadal-to-multi-decadal timescales as well as chemical and 
physical processes related to aerosol forcing*’; overcoming these deficien- 
cies in the next generation of climate models is critical. A growing body 
of evidence from palaeoclimatic data underpins a strong influence of 
solar and volcanic forcing on monsoon variability, and may help to better 
identify these model deficiencies. 
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Outlook 

Various forcings acting on different timescales affect the monsoon systems 
on a global scale (Figs 2, 4); regional monsoon responses differ from 
global responses in terms of sensitivity and timing, which can be attrib- 
uted to different feedbacks and teleconnections. Palaeoclimatic records 
may help to elucidate the regional characteristics of the different monsoon 
subsystems and to better understand the internal feedback processes 
within the climate system; these feedback processes are suggested to 
influence the recent monsoon circulation more severely than the external 
forcing by increasing GHGs’. In addition, palaeoclimatic records provide 
valuable information that can be used to assess the ability of climate 
models to simulate monsoon changes for different climates, and are of 
utmost importance in testing model-derived hypotheses. For instance, 
a recent model study highlighted the role of differential warming between 
the Northern Hemisphere extratropics and tropics in determining the 
future development of the North African monsoon, where strong extra- 
tropical warming will induce a substantial increase in Sahel rainfall”. 
A large obliquity signature in the North African palaeo-monsoon records” 
strongly supports this model-derived finding, because obliquity forcing 
involves a substantial change in the meridional gradients of insolation and 
heating in the summer hemisphere”! as well as amplification of extrat- 
ropical temperature change via cryosphere expansion and retraction’. 
In a more general sense, it has been suggested that obliquity-induced 
changes have some qualitative similarities with global-warming-induced 
changes, with respect to the summer-hemisphere Hadley cell and ITCZ 
shifts related to mid-latitude eddy activity*!. Therefore, a deeper under- 
standing of obliquity-induced changes in tropical and extratropical cir- 
culation may help to better assess and predict future changes in monsoon 
rainfall. 

Climate models predict that the ongoing interhemispheric thermal 
asymmetry will lead to a northward shift of the mean ITCZ position’, 
but accurate projection of monsoon rainfall suffers from uncertain- 
ties in SST warming. Because the location of the ITCZ is sensitive to 
changes in the meridional SST gradient!?, which can be constrained using 
palaeo-thermometric methods, palaeoclimate reconstructions that involve 
changes in (inter)hemispheric SST gradients provide unambiguous 
evidence for such ITCZ migrations and a blueprint of the associated 
changes in monsoon rainfall patterns. 

Changes in monsoon rainfall can be thermodynamic or dynamic”, and 
may be related to changes in the length of the rainy season, the rainfall 
intensity or the monsoon area. The various aspects of changes in mon- 
soon rainfall require a deeper understanding of the mechanisms behind 
palaeo-monsoon variability, which may be achieved by means of palaeo- 
climatic monsoon reconstructions that use multiple proxy records, ideally 
from the same location”. Combining proxies for wind and precipitation 
changes may help to estimate the relative importance of the dynamic and 
thermodynamic aspects of changes in monsoon rainfall. A promising 
approach for distinguishing past changes in wet-season intensity from 
changes in wet-season length has been proposed, which involves com- 
bining leaf-wax 5D with leaf-wax §'°C analyses”. 

Palaeoclimatic data might contain further quantitative information on 
the magnitude of change that could help to better project future climate 
variations. Climate model experiments suggest that the observed decrease 
in South Asian monsoon rainfall over the last 70 years is mostly attributa- 
ble to anthropogenic aerosol emissions”®. Water-isotope-enabled climate 
models will be essential for the appropriate interpretation of water-isotope 
records and a quantitative reconstruction of past monsoon rainfall varia- 
bility; however, reducing the uncertainty related to atmospheric aerosol 
loading and to the associated teleconnections, including the ENSO, PDO 
and AMO, is of equal importance for achieving more trustworthy cli- 
mate projections. In particular, it has been shown that eastern and central 
Pacific ENSO events have different effects on the South Asian® and East 
Asian”’ monsoons. Indisputable and continuous records of the different 
states (El Nifio and La Nifia) and types (central and eastern Pacific) of 
the ENSO and of the PDO and AMO are critical to evaluate the stability 
of monsoon teleconnections. Finally, the middle Pliocene is probably 
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the best available analogue for future warming and, thus, palaeoclimate 
records of the warm, high-CO, Earth of the Pliocene are urgently needed 
to narrow down uncertainties in future monsoon projections. 
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The whole-genome duplication 80 million years ago of the common ancestor of salmonids (salmonid-specific fourth 
vertebrate whole-genome duplication, Ss4R) provides unique opportunities to learn about the evolutionary fate of 
a duplicated vertebrate genome in 70 extant lineages. Here we present a high-quality genome assembly for Atlantic 
salmon (Salmo salar), and show that large genomic reorganizations, coinciding with bursts of transposon-mediated 
repeat expansions, were crucial for the post-Ss4R rediploidization process. Comparisons of duplicate gene expression 
patterns across a wide range of tissues with orthologous genes from a pre-Ss4R outgroup unexpectedly demonstrate far 
more instances of neofunctionalization than subfunctionalization. Surprisingly, we find that genes that were retained as 
duplicates after the teleost-specific whole-genome duplication 320 million years ago were not more likely to be retained 
after the Ss4R, and that the duplicate retention was not influenced to a great extent by the nature of the predicted protein 
interactions of the gene products. Finally, we demonstrate that the Atlantic salmon assembly can serve as a reference 


sequence for the study of other salmonids for a range of purposes. 


The 22,000-year-old cave painting of an Atlantic salmon (Salmo salar) 
near the Vézére River in France is a reminder of our fascination with, 
and dependence on, Atlantic salmon throughout human history. 
Atlantic salmon belongs to the salmonid lineage which comprises 11 
genera, with at least 70 species that exhibit a wide range of ecologi- 
cal adaptations and use a variety of marine and freshwater life history 
strategies’. Salmonids hold important positions as socially iconic 
species and economic resources within aquaculture, wild fisheries and 
recreational sport fisheries. Moreover, they serve as key indicator species 
of the health of North Atlantic and Pacific coastal and river ecosystems. 

All teleosts share at least three rounds of whole-genome duplication 
(WGD), 1R and 2R before the divergence of lamprey from the jawed 
vertebrates’, and a third teleost-specific WGD (Ts3R) at the base of the 
teleosts ~320 million years ago (Mya)*~>. Very little is known about the 
mechanisms of genomic and chromosomal reorganization after WGD 
in vertebrates because the 1R, 2R and Ts3R occurred so long ago that few 
clear signatures of post-WGD reorganization events remain. In contrast, 
a fourth WGD (the Ss4R salmonid-specific autotetraploidization event) 
occurred in the common ancestor of salmonids ~80 Mya after their 
divergence from Esociformes ~125 Mya®* (Fig. 1), and the continued 
presence of multivalent pairing at meiosis and evidence of tetrasomic 


inheritance in salmonid species suggests that diploidy is not yet fully 
re-established®*''°. Salmonids thus appear to provide an unprecedented 
opportunity for studying vertebrate genome evolution after an autotetra- 
ploid WGD!""” over a time period that is long enough to reveal long-term 
evolutionary patterns, but short enough to give a high-resolution picture 
of the process. In addition, they provide an excellent setting for contex- 
tualizing genome evolution with a dramatic post-WGD species radiation 
and intricate adaptations to a whole range of life history regimes. 

Here we present a high-quality reference genome assembly of the 
Atlantic salmon, and use it to describe major patterns characterizing 
the post-Ss4R salmonid genome evolution over the past 80 million 
years (Myr). Our results challenge the recent claim that rediploidiza- 
tion in salmonids has been a gradual process unlinked to significant 
genome rearrangements’. They also challenge current views about 
the relative importance of sub- and neofunctionalization in vertebrate 
genomes (reviewed in ref. 14), and the importance of dosage balance 


as a gene duplicate retention mechanism”. 


Genome characterization 
The Atlantic salmon reference genome assembly (GenBank: 
GCA_000233375.4) adds up to 2.97 gigabases (Gb) with a 


1Centre for Integrative Genetics (CIGENE), Department of Animal and Aquacultural Sciences, Norwegian University of Life Sciences, As NO-1432, Norway. 2Department of Biology, University of 
Victoria, Victoria, British Columbia V8W 3N5, Canada. 3J. Craig Venter Institute, 9704 Medical Center Drive, Rockville, Maryland 20850, USA. “Department of Chemistry, Biotechnology and Food 
Science, Norwegian University of Life Sciences, As NO-1432 Norway. "Department of Plant Physiology, Umea Plant Science Centre, Umea University, Umea 90187, Sweden. “Institute for Physical 
Sciences and Technology, University of Maryland, College Park, Maryland 20742-2431, USA. 7Department of Molecular Biology, University of Wyoming, Laramie, Wyoming 82071, USA. 8Center 
for Computational Genetics and Genomics, Temple University, Philadelphia, Pennsylvania 19122-6078, USA. 9Department of Biology, Temple University, Philadelphia, Pennsylvania 19122-6078, 
USA. !°Center for Mathematical Modeling, University of Chile, Santiago 8370456, Chile. !!Center for Genome Regulation, University of Chile, Santiago 8370415, Chile. !*Medical Genetics, Oslo 
University Hospital and University of Oslo, Oslo NO-0424, Norway. !SDepartment of Virology, Norwegian Veterinary Institute, Oslo NO-0454, Norway. !4Centre for Ecological and Evolutionary 


Synthesis (CEES), Department of Biosciences, University of Oslo, Oslo 


0-0316, Norway. SCHORI, Oakland, California 94609, USA. '*AquaGen, Trondheim NO-7462, Norway. !’Nofima, Tromsa 


NO-9291, Norway. !8National Center for Cool and Cold Water Aquaculture, ARS-USDA, Kearneysville, West Virginia 25430, USA. !9Beckman Genomics, Danvers, Massachusetts 01923, USA. 
2°Courtagen Life Sciences, Woburn, Massachusetts 01801, USA. 2!BGI-Shenzhen, Shenzhen 518083, China. 2*Laboratory of Molecular Ecology, Genomics, and Evolutionary Studies, Department 
of Biology, University of Santiago, Santiago 9170022, Chile. 2°Faculty of Medicine, University of Chile, Santiago 8380453, Chile. *Genome Sciences Centre, BC Cancer Agency, Vancouver, British 
Columbia V5Z 4S6, Canada. @°Department of Molecular Biology and Biochemistry, Simon Fraser University, Burnaby, British Columbia V5A 186, Canada. @®Department of Informatics, University 
of Bergen, Bergen NO-6020, Norway. 2’Centre for Biodiversity Dynamics, Department of Biology, NTNU - Norwegian University of Science and Technology, Trondheim NO-7491, Norway. 


200 | NATURE | VOL 533 | 12 MAY 2016 


© 2016 Macmillan Publishers Limited. All rights reserved 


Spotted gar (Lepisosteus oculatus) 


Zebrafish (Danio rerio) 


Ts3R 
Stickleback (Gasterosteus aculeatus) 


Pike (Esox lucius) 


Salmonids : 
Coregonus + Prosopium + Stenodus 


Thymallus 
Ss4R 


Brachymystax + Hucho 


Oncorhynchus + Salvelinus + Salvethymus 


eared 


i T T T 1 
400 300 200 100 0 
Million years ago 


Atlantic salmon 
(Salmo salar) 


Figure 1 | Phylogenetic relationship of salmonids and relevant teleost 
lineages. Divergence ages for salmonids are taken from ref. 8 and older 
divergences from ref. 7. Parahucho is not included in the figure due 

to uncertainty of its phylogenetic position. Ages do not represent the 
exact point estimates from the respective studies. Yellow and red circles 
represent the teleost specific whole genome duplication (Ts3R) and 
salmonid-specific whole genome duplication (Ss4R), respectively. 


ctgN50 = 57.6 kb, which is consistent with genome size estimates!®. 
Linkage mapping was used to position and orient 9,447 scaffolds 
(scfN50 = 2.97 megabases (Mb)), representing 2.24 Gb, into 29 sin- 
gle chromosome sequences (Supplementary Table 4). Most scaffolds 
not anchored to chromosomes consist of repetitive sequences. The 
58-60% repeat content of Atlantic salmon is among the highest 
found in any vertebrate’’. The single largest class of transposable ele- 
ments is the Tcl-mariner family, representing 12.89% of the genome 
(Supplementary Information section 3). Tcl-mariner transposons tend 
to occur in centromeric regions (Fig. 2, track c), as reported in other 
species!®, 

Annotation of gene structures using RNA sequencing (RNA-seq) 
and expressed sequence tags (ESTs) identified 46,598 genes classified 
as non-repeat associated loci with sequence similarity support from 
the PFAM database, and/or zebrafish and stickleback annotations 
(Supplementary Table 11). Functional annotation identified a final 
set of 37,206 high-confidence protein-coding gene loci that have 
been assigned a putative functional annotation based on homology 
within the SwissProt database. Ninety-five per cent of the 498,245 
public ESTs, and 98.3% of the identified loci were mapped to the 
29 chromosome sequences, indicating a nearly complete representa- 
tion of the protein-coding genome (Supplementary Information 
section 1.5). 


Post-Ss4R rediploidization characteristics 

The return of a duplicated genome from tetrasomic to disomic inher- 
itance relies on the obstruction of quadrivalent pairing during meiotic 
cell division. Large chromosome rearrangements through chromosome 
fusions, fissions, deletions or inversions strongly disrupt the possibility 
for homeologous pairing (the pairing of homeologue duplicates aris- 
ing from a WGD)'*””. As extensive collinear blocks that include the 
telomere for at least one of the chromosome pairs is a diagnostic for 
current or recent multivalent pairing due to sequence homogenization 
(reviewed in ref. 21), we predicted that there would be an inverse rela- 
tionship between homeologous sequence similarity and chromosome 
rearrangements in the duplicated blocks. 

To test this prediction, we identified and analysed 98 homeolo- 
gous (duplicated) blocks with high collinearity by aligning Atlantic 
salmon chromosome sequences against each other (Supplementary 
Information section 2). The 98 blocks (196 regions) account 
for 2.11 Gb (94.4%) of chromosome-anchored sequence (Fig. 2, 
Supplementary Table 6). A large proportion of homeologous blocks, 
representing roughly 573 Mb (25.6% of the chromosome-positioned 


ARTICLE 


sequence), had a sequence similarity >90%. These regions were clus- 
tered within seven pairs of chromosome arms (2p-5q, 2q-12qa, 3q-6p, 
4p-8q, 7q-17qb, 11qa-26, 16qb-17qa, and to some extent 9qc-20qb 
and 5p-9qb (Fig. 2)), and are all characterized by large collinear blocks 
including the telomere within at least one of the chromosome pairs. 
Previous studies in salmonids have claimed that at least one metacen- 
tric chromosome must be involved to provide the stability required 
for the formation of multivalents and homeologous pairing”*. Our 
findings for regions 11qa—26 and 16qb-17qa indicate that this is not 
a strict necessity. Notably, increased read alignment depth and shorter 
scaffolds were characteristic of regions exceeding 95% similarity, 
representing 210 Mb (9.4% of the chromosome-positioned sequence), 
suggesting assembly collapse (Fig. 2, Supplementary Information 
section 1.5). 

Without exception, duplicated regions exhibiting rearrangements at 
telomeres in the form of inversions, translocations or larger deletions 
all displayed a sequence similarity of ~87%. This clear correspond- 
ence between the degree of intra-block sequence similarity and blocks 
predicted to still participate in tetrasomic inheritance (or recently have 
done so) suggests that up to 25% of the salmon genome experienced 
delayed rediploidization after the initial large chromosome rearrange- 
ments, and that as much as 10% of the genome may still retain residual 
tetrasomy (Supplementary Table 7). The large and highly collinear 
blocks of shared synteny between Atlantic salmon and rainbow trout 
(Extended Data Fig. 1) imply that these rearrangements must have 
taken place before the split of the two lineages. This is also supported 
by combined genome mapping and karyotyping studies in other mem- 
bers of the Salmoninae subfamily, documenting conservation of large 
blocks embracing whole chromosome arms””. 

To scrutinize this further, we analysed a set of 2,487 gene trees 
from orthologous gene sets containing putative homeologous pairs 
for both Atlantic salmon and rainbow trout (Oncorhynchus mykiss) 
(Supplementary Information section 5). As this analysis required 
calibration against an outgroup, we included only homeologous 
pairs having an orthologue in the Northern pike (Esox lucius), a 
member of the closest related diploid sister-group to salmonids”*. 
Our results suggest ~100-80 Mya as a lower boundary for the Ss4R 
and that the Salmo-Oncorhynchus divergence occurred ~21 Mya 
(Fig. 3b; Extended Data Fig. 2c and Supplementary Information 
section 6), in agreement with recent age estimates*"’. Interestingly, 
analysis of asymmetry in coding sequence evolution between home- 
ologues showed that a major part of the sequence divergence hap- 
pened since the Salmo-Oncorhynchus split, suggesting a considerable 
temporal decoupling between the Ss4R event and sequence diver- 
gence of the Ss4R duplicates (Supplementary Information section 6). 
Moreover, our molecular dating results suggest that the majority of 
the Ss4R duplicates returned to disomic inheritance in a common 
ancestor of all salmonids before ~60 Mya (Fig. 3c). The results from 
the gene tree analysis are thus consistent with the data on homeolo- 
gous sequence similarity (Extended Data Fig. 2b), strongly suggest- 
ing that large genomic reorganizations have been instrumental for 
the rediploidization process following the Ss4R. Our findings thus 
challenge one of the main conclusions from the recent sequencing of 
the rainbow trout genome, which suggested that rediploidization in 
salmonids has been a gradual process unlinked to significant genome 
rearrangements’. 

Considering possible mechanisms underlying these large genomic 
reorganizations, the distribution of major transposon families in the 
Atlantic salmon genome suggests transposable element expansion in 
an ancestral salmonid before the chromosome fusions occurring in the 
Atlantic salmon lineage (Fig. 2, track c). The 85% sequence divergence 
among a large number of transposon family members is compara- 
ble to the lower boundary of homeologue block similarity (~87%). 
Assuming comparable neutral clock-like sequence divergence, this 
correspondence is consistent with a burst of repeat expansions coin- 
ciding with the initiation of rediploidization post-Ss4R (Fig. 3a and 
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Figure 2 | The duplicated Atlantic salmon genome. Homeologous 
regions in the Atlantic salmon genome subdivided into 98 collinear blocks 
along the 29 European Atlantic salmon chromosomes. Red rectangles 
represent blocks of sequence without identifiable duplicated regions 
elsewhere in the genome. a, This track shows grouping of salmon sequence 
into regions; red = high (>95% sequence similarity), orange = elevated 


Extended Data Fig. 2b and Supplementary Information section 6.2). 
As large-scale expansion and movement of transposable elements are 
known to increase under genomic stress”, this may suggest that Ss4R 
caused transposable element expansion by compromising regulatory 
processes responsible for transposon policing. This expansion might 
in turn have been a major determinant for driving the genome towards 
a diploid state through enhanced homeologue sequence divergence 
and large chromosome rearrangements due to ectopic transposable 
element recombination and chromosomal breakage causing non- 
homologous end-joining” (Fig. 3c). 


Duplicate retention—patterns and mechanisms 

To assess the evolutionary fates of duplicated genes in the salmon 
genome, we analysed patterns of Ss4R duplicate retention and func- 
tional divergence of protein-coding genes within the 98 homeolo- 
gous blocks. Considering that we find very little evidence for gene 
loss through fractionation”®, and that in 56% of the 9,162 singletons 
we were able to identify a pseudogenized homeologue gene fragment 
in an expected position (Supplementary Information section 4 and 
Supplementary Table 11), pseudogenization appears to be the predom- 
inant mechanism underlying Ss4R duplicate loss. 
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(90-95% sequence similarity), green = low (~87% sequence similarity), 
yellow = telomeric regions (10 Mb) characterized by highly elevated male 
recombination (see ref. 10). b, This track shows genomic similarity (in 

1 Mb intervals) between duplicated regions (red = high, yellow = medium, 
green = low sequence similarity). c, Ths track shows frequency of 
Tcl-mariner transposon elements in the Atlantic salmon genome. 


To contrast the Ss4R with the 240 Myr older Ts3R duplicate reten- 
tion patterns, we analysed duplicate retention patterns in teleost gene 
family trees (ref. 27; Supplementary Information section 8). This 
revealed that 20% of the Ts3R and 55% of the Ss4R duplicates are 
retained as two functional copies in Atlantic salmon. In comparison, 
12-24% of duplicated genes derived from the Ts3R event have been 
retained in other extant teleost fish lineages (reviewed in ref. 28), and 
the retention 75 Myr post-Ts3R has been estimated to have been about 
40%*"°. Considering the uncertainty attached to such estimates, the 
post-Ss4R temporal retention profile of Atlantic salmon is arguably 
quite similar to that of other teleosts post-Ts3R, indicating that mech- 
anisms responsible for duplicate retention in Atlantic salmon may be 
generic. 

Surprisingly, Atlantic salmon genes that were retained as duplicates 
after the Ts3R event were not more likely to be retained after the Ss4R 
(Extended Data Fig. 3; Supplementary Information section 8). The pre- 
dominantly independent probabilities of retention suggest a complex 
interplay of processes, different evolutionary drivers of duplicate reten- 
tion, or a largely neutral and stochastic nonfunctionalization process 
following the Ts3R and Ss4R events. Interestingly, we observed enhanced 
retention of non-WGD gene duplicates (older or younger than the Ss4R 
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Figure 3 | Post-Ss4R rediploidization. a, Fig. 3a shows a significant 

and ongoing expansion of transposable elements from the Tcl-mariner 
superfamily with major peaks at an average of 87%, 93% and 98% 
similarity between family members. The colours correspond to the same 
colours as in the box plot in Extended Data Fig. 5. b, Age estimates of the 


event) when the WGD (both Ts3R and Ss4R) duplicates also had been 
retained (P< 0.001; Supplementary Information section 8). 

Two major mechanisms by which a pair of duplicates can escape 
the fate of nonfunctionalization are subfunctionalization (partition- 
ing of ancestral gene functions)*° and neofunctionalization (assign- 
ing a novel function to one of the duplicates)*!. To assess the relative 
importance of these two mechanisms we analysed gene expression 
divergence of Ss4R duplicates across 15 tissues (Extended Data 
Fig. 4a, b; Supplementary Information section 7). Forty-five per cent 
(3,991/8,954) of well-defined expressed Ss4R pairs showed signs of 
diverged expression by being located in different co-expression clus- 
ters (Fig. 4a). Diverged homeologues tended to belong to closely 
related but still different co-expression clusters (Fig. 4a and Extended 
Data Fig. 4d). 

Although these results suggest that functional divergence is com- 
mon among Ss4R duplicates, information about ancestral state is 
critical for the classification of this divergence into sub- and neofunc- 
tionalization. We therefore used comparable expression data across 13 
common tissues from diploid Northern pike”? as a proxy for the ances- 
tral state of Ss4R duplicates. We identified 8,102 orthologous gene 
triplets (that is, two Ss4R copies and their putative pike orthologue) 
and in 42% of the triplets both Ss4R duplicates showed conserved 
co-expression profile with the pike orthologue (Pearson correlation 
>0.6, P< 0.03). This indicates strong purifying selection pressure on 
gene regulation across more than 100 Myr and adds credibility to the 
use of Northern pike for assessing ancestral gene regulation. In 28% of 
the triplets, one Ss4R duplicate had a conserved co-expression pattern 
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with pike and the other belonged to a different co-expression cluster 
(Fig. 4b), indicative of regulatory neofunctionalization. 

Although we observed cases of putative pseudogenization in Ss4R 
duplicates displaying a low correlation in expression regulation in 
combination with large coding sequence length difference, most Ss4R 
duplicates had similar lengths regardless of their expression similarity 
(Extended Data Fig. 4e), suggesting that neutral evolution can only 
marginally explain this regulatory divergence. 

We identified 1,084 triplets where the salmon duplicates belonged to 
different expression clusters and had expression profiles significantly 
different from pike (Pearson correlation <0.55, P > 0.05), pointing 
to possible subfunctionalization. In this group we found, somewhat 
surprisingly, only 23 clear examples of subfunctionalization where 
the sum of the expression patterns of salmon homeologues correlated 
significantly with assumed ancestral state. However, this cluster-based 
analysis neglects subtler within-cluster subfunctionalization cases, as 
well as those involving acquisition of novel functions after subfunc- 
tionalization. To account for this, we applied an ‘on-off’ classifica- 
tion method (Extended Data Fig. 4f and Supplementary Information 
section 7.2) that increased the estimate to 167 cases; a figure that is 
still dwarfed by the estimated number of neofunctionalization cases 
(3,028) (Supplementary Information section 7.2). 

Purifying selection on dosage sensitive interactions with other 
duplicated genes is thought to be an important mechanism for inter- 
mediate duplicate retention after WGDs!°, before neo-, sub- and 
nonfunctionalization determine the ultimate fate of the duplicates*”. 
In line with this, we observed an overrepresentation of GO terms 


12 MAY 2016 | VOL 533 | NATURE | 203 


© 2016 Macmillan Publishers Limited. All rights reserved 


ARTICLE 


a b | 
| : | 
_ =a = 
= < ae ren, me Der 
F Sg 
sod Diverged Conserved Conserved 
Skin + muscle (B) =——__= == = 
Nose + gill (C) = 
Kidney (D) 
Gut + PC (E) 


Figure 4 | Homeologue divergence. a, Circos plot distribution of 
homeologous gene pairs and their assignment to 11 co-expression clusters 
based on 15 different tissues. Lines connect Ss4R pairs that belong to 
different co-expression clusters. For visualization purposes, we sorted the 
Ss4R pairs according to type of co-expression divergence. Red lines signify 


associated with signal transduction, protein complex formation and 
transcription among the duplicated genes with conserved regula- 
tion (Supplementary Information section 7.3 and Supplementary 
Table 16). However, as a diversity of GO terms not focal to the dosage 
balance hypothesis (Supplementary Table 16) are also overrepre- 
sented among Ss4R duplicates with conserved regulation, it is not 
justified to conclude that dosage balance is the sole intermediate 
retention mechanism. Furthermore, analyses of retention patterns 
after Is3R and Ss4R suggest independent retention probability and 
a very weak effect of preferential co-retention of known protein 
interacting partners (P < 0.001) for both the Ts3R and Ss4R dupli- 
cation events (Extended Data Fig. 3 and Supplementary Information 
section 8). 

Taken together, >60% of the homeologue pairs show signatures of 
tissue-dependent regulatory divergence at the whole gene or exon- 
level (Supplementary Information section 7.2). The predominance of 
cases where only one copy has changed its regulation compared to the 
assumed ancestral state indicates that regulatory subfunctionalization 
has not been a dominant duplicate retention mechanism post Ss4R, 
unless it was followed by subsequent neofunctionalization, which has 
been suggested as a common process***4, However, our subfunction- 
alization estimates together with the high frequency of triplets where 
one salmon homeologue had a conserved co-expression pattern with 
pike while its duplicate did not (Fig. 4b), are not consistent with the 
generality of this latter scenario. 


Areference genome for salmonids 

Conservation of synteny between salmonids**~” suggests that informa- 
tion from one high-quality salmonid genome can be used to improve 
genome sequence assemblies of other salmonids. To test the feasibility 
of such a comparative genomics approach, we used the Atlantic salmon 
assembly to construct chromosome sequences for the non-chromosome 
anchored rainbow trout genome sequence!3. We were able to map 
99.5% of rainbow trout scaffolds >100 kilobases (kb) (total 1.22 Gb) 
to the Atlantic salmon chromosome sequences (Supplementary 
Information section 1.5). 


22,35 
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significant resampling tests (P < 0.05) for enrichment of homeologue 
divergence between two specific co-expression clusters. b, Heatmap of 
2,272 triplets (two salmon homeologues and a pike orthologue), in which 
one of the Atlantic salmon homeologues has diverged in gene expression 


Using the Atlantic salmon chromosome sequences together with a 
dense linkage map for rainbow trout constructed from a 57K single 
nucleotide polymorphisms (SNP) array, we were able to anchor, orient 
and concatenate 11,335 rainbow trout scaffolds (scf{N50 = 940 kb, from 
ref. 13) into 29 rainbow trout chromosome sequences (Supplementary 
Information section 9). This was done by first using the rainbow trout 
linkage map to determine the proximate order of 2,439 trout scaf- 
folds containing SNPs, which we found to be sufficient for determin- 
ing conserved blocks. Then we used comparative information from 
Atlantic salmon to incorporate scaffolds without SNP information, 
and fine-tune the order and orientation of all 11,335 trout scaffolds 
into chromosome sequences. Even though the rainbow trout linkage 
map contains more markers than most other salmonids (for example, 
ref. 22), this high number of properly placed scaffolds would not be 
achievable without the Atlantic salmon information. 

Alignment of these rainbow trout chromosomes (representing 
1.37 Gb of sequence) with the Atlantic salmon genome revealed con- 
servation of very large syntenic blocks, in many cases correspond- 
ing to whole chromosome arms in rainbow trout (Extended Data 
Fig. 1). This analysis supports previous results** suggesting conser- 
vation of 50 syntenic regions representing the karyotype of 50 acro- 
centric chromosomes in the common ancestor of salmonids**. Our 
analysis documents that these syntenic regions typically represent 
blocks with no rearrangements for 38 regions and with only one or 
two inversions or translocations among the remaining parts. 


Implications 

The conservation of large collinear blocks between Salmo and 
Oncorhynchus strongly suggests that the Atlantic salmon genome infor- 
mation will facilitate exploitation of genomic information in a wide 
range of ecological, evolutionary, conservation and production biology 
settings within salmonids. Moreover, the availability of a high-quality 
assembly and annotation of the Atlantic salmon genome provides novel 
insights into vertebrate post-WGD evolution that may contribute to a 
more thorough understanding of the underlying mechanisms as well 
as the long-term importance of WGD for adaptation. 
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METHODS 


Data reporting. No statistical methods were used to predetermine sample size. 
The investigators were not blinded to allocation during experiments and outcome 
assessment. 

Genome sequencing and assembly. DNA from a single double-haploid female 
from the AquaGen strain, produced by mitotic androgenesis, served as the tem- 
plate for sequencing using Sanger and next generation sequencing technologies 
(Supplementary Table 1). Various assemblies were generated using different 
combinations of software and subsets of data (Supplementary Table 2). The foun- 
dation of the chosen assembly was generated from Sanger (~4x) and Illumina 
(~202 x) data assembled using the MaSuRCA (v2.0.3) assembler*”. The assem- 
bly was reconciled and gap-filled using information from preliminary assemblies 
(Supplementary Information section 1.3). Genetic linkage information describing 
565,877 SNPs was used to both confirm and correct scaffolds and, when supported 
by information from other assemblies, was used to join scaffolds within linkage 
groups. Subsequently, linkage analysis using CRIMAP*® and a subset of SNP 
sequence tags (27,221) were used to order, orient and concatenate scaffolds into 29 
single-chromosome sequences. Nomenclature for Atlantic salmon chromosomes 
is based on ref. 35. 

Gene annotation. Gene structures were determined by combining data from full- 
length cDNA sequences*”, EST databases*?“’, and RNA-seq data from 15 tissues 
(Supplementary Table 9). RNA-seq reads were trimmed using Trimmomatic (v0.32 
(ref. 42)) and mapped to the reference genome sequence using STAR (v2.3.1z12 
(ref. 43)), and all publicly available mRNAs and ESTs were mapped using GMAP“*. 
Gene structures were predicted with CUFFLINKS”. Open reading frame (ORF) 
predictions were carried out using TransDecoder*®. Gene models without homol- 
ogy match to either PFAM, stickleback or zebrafish were discarded. Functional 
annotation was done with Blast2GO”’ against the SwissProt database. Transposable 
element related ORFs were identified with BLAST searches against the annotated 
transposable element sequences and queries in the functional annotation gene 
names for transposable element related terms (that is, retrotransposon, transposon, 
transposable, transposase, reverse transcriptase, gag, bpol). Putative expressed 
and silenced Ss4R homeologues were identified using a combination of homology 
searches with BLAST and GenomeThreader** targeting a priori defined conserved 
collinear duplicated regions (n= 98). 

Repeat library methods. An Atlantic salmon repeat library of 2,005 elements 
was assembled from sequences previously reported in salmonids!**?° and the 
output of the de novo repeat-finding programs LTRharvest?!, RepeatModeller®” and 
REPET®’. With the exception of curated repeats previously reported by Matveev 
and Okada® and those found in the RepBase database”, all preliminary sequences 
were validated using BLASTn™ to ensure that they were present at multiple loca- 
tions in the genome. LTRharvest sequences were filtered based on the repeat library 
construction procedure outlined in the MAKER documentation®. Using BLASTn, 
sequences from other de novo sources and the rainbow trout repeat library were 
flagged as potentially chimaeric if they did not generate at least three high-scoring 
segment pairs (HSPs) covering at least 80% of their length in the Atlantic salmon 
genome. Any distinct highly repetitive region within such sequences was extracted 
and retained while other portions were discarded. All libraries were merged and 
redundant sequences were removed based on the guidelines presented by Wicker 
et al.®° and the MAKER documentation. Sequences in the combined library were 
annotated, and non-transposable element host genes were removed based on 
their similarity to well-characterized sequences in annotation databases*”*”, the 
presence of structural motifs and manual examination. 

To estimate the historical activity of Tcl-mariner transposable elements, up to 
100 randomly selected full-length genomic copies from each of 40 Tcl-mariner 
families were extracted and aligned using MUSCLE™. All families were con- 
firmed to be phylogenetically distinct from each other and possessed a star-like 
neighbour-joining tree topology characteristic of Tel-mariner activity’. The dis- 
tribution of pairwise per cent similarity, a proxy for time, between members of a 
family was used to analyse the temporal dynamics of transposable element activity. 
Identification of homeologous blocks within the salmon genome. Repeat 
masked chromosome sequences for Atlantic salmon (see above) were aligned 
against each other using LASTZ™ to identify 98 homologous blocks originating 
from the Ss4R (for details see Supplementary Information section 2). Sequence 
similarity between homeologous sequences were determined in 1 Mb intervals 
by averaging local percentage of nucleotide sequence identity using high-scoring 
segment pair (HSP) from LASTZ alignments” and presented as a Circos plot®! 
in Fig. 2. 

Sequence evolution analyses of salmon homeologues. Putative orthologue 
sequence sets were collated with Best Reciprocal Blast (BRB) protein matches. 
For salmonid species the top-two BRB-hits were assigned to putative orthologue 
groups. Multiple codon sequence alignments were constructed using MAFFT 


and quality trimmed with Guidance in an iterative framework where sequences 
were re-aligned after identification of poorly aligned codons. 

Maximum likelihood (ML) gene trees were calculated by the R-package 
Phangorn®™ using codon alignments, the GTR+G-+I model, and 100 bootstrap 
replicates. Branch specific GTR+G-+I substitution rates were estimated func- 
tions from the R-package ape™, while branch specific synonymous (dS) and non- 
synonymous (dN) substitution rates were estimated with non-negative least squares 
regression in the Phangorn R package™ using pairwise dN and dS distance matrixes 
from codeml® and the ML gene tree topologies as input. 

Branch-site specific test for positive selection was carried out by a likelihood- 
ratio test on the ML-likelihood estimates for sequence evolution under different 
models in codeml. The smallest likelihood estimate from four omega starting val- 
ues (0.5, 1, 1.5, and 2) was used in the likelihood ratio test (LRT). False discovery 
rate adjustments of p-values were done with the p.adjust function in R. 

Gene tree dating. BEAST® was used to calibrate gene trees using a HKY+G sub- 
stitution model, uncorrelated lognormal clock, and yule tree prior. The BEAST 
analyses were exclusively based on codon alignments that produced a ML-gene tree 
topology containing two Ss4R homeologues in both Salmo and Oncorhynchus, and 
where rediploidization had occurred before the Salmo-Oncorhynchus divergence. 
No priors on tree topology were specified and a single secondary calibration of 
127 Myr (confidence interval 12.5 Myr) on the most recent common ancestor 
of Salmoniformes + Esociformes was used”*, All Markov chain Monte Carlo 
(MCMC) analyses were run for 10 million generations with sampling every, 1000 
generations. Tracer v1.6 (available from http://beast.bio.ed.ac.uk/Tracer) was used 
to inspect effective sample sizes (ESS) of tree parameters. Fifty per cent consen- 
sus topologies were constructed based on 100 randomly sampled tree topologies 
from the last 1,000 MCMC-samples. Age of Salmo-Oncorhynchus divergence was 
estimated as the median of two nodes per tree. 

Transcriptome analysis. A gene was classified as ‘expressed’ if the FPKM value of 
at least one tissue was above 1.0, and values were transformed to log, (FPKM-+1) 
values for consecutive analysis. Samples and genes were clustered using Pearson 
correlation and Ward’s method in the R function hclust®”, and visualized as heat- 
maps using the R function heatmap.2 (gplots library). Genes were scaled individ- 
ually in the heatmaps. 

Clusters with a significant number of shared homeologue-pairs were identi- 
fied by simulation (10,000 randomizations). A salmon gene (or exon) was classi- 
fied as conserved if the Pearson correlation to the pike orthologue was above 0.6 
(P=0.03) across the 13 common tissues, and diverged if the correlation was below 
0.55 (P>0.05). A salmon homeologue-pair was classified as neofunctionalized if 
at least one salmon gene was conserved and the two salmon genes were in differ- 
ent clusters, and as subfunctionalized if both salmon genes were diverged and in 
different clusters, but their summed expression was conserved. 

Expression specificity was computed as one minus the sum, over all samples, 

of the gene’s expression in that sample divided by the maximum expression in any 
sample. Significant difference in specificity between clusters was computed using 
the Wilcoxon test. 
Duplicate retention. Existing gene families for all teleost species were downloaded 
from Ensembl Compara 79 (ref. 27). Genomes for Salmo salar, Esox lucius, and 
Oncorhynchus mykiss were added to these gene families or used to create new gene 
families with BLAST to determine homologous relationships (e-value >1e-10 
and %id>50)). Multiple sequence alignments of extended gene families with 
Lepisosteus oculatus as an outgroup were produced using MAFFT® (command 
line option -auto) and gene trees were built with PhyML 3.4 (ref. 68) using the 
JTT+G substitution model. Using the NCBI teleost species tree, Softparsmap™ was 
used to identify duplication and speciation event in trees. This resulted in 12,388 
gene families with a speciation root node, encompassing 26,325 salmon genes. 

The constructed gene trees were then assessed for duplicate retention for the 
Ts3R, Ss4R, small scale salmon specific duplications (SSD) following the Ss4R 
event, and duplications occurring between the Ts3R and Ss4R. Duplicate retention 
was counted by examining the conditional percentages of genes that were retained 
from the Ss4R following the Ts3R, and from the Ss4R to small-scale duplications 
on the salmon lineage. The duplication lineage for each gene was counted, ensur- 
ing that each lineage accounted for the retention or loss of a duplicate, with the 
expectation that each Ts3R duplication should give rise to two Ss4R, and every 
Ss4R should lead to two small scale duplications. Post3R-preSs4R SSDs also share 
an expectation of having resulted in two Ss4R duplications. Where nodes could 
be assigned as being either Ss4R or SSD, the chromosomal locations of the genes 
were used to differentiate between the ambiguous nodes. Such ambiguous nodes 
were determined to be SSDs if the duplicate salmon genes resided on the same 
chromosome; otherwise it was classified as being Ss4R. Since only a single Ss4R 
duplication occurred along a lineage, if two ambiguous nodes were found that 
could be classified as Ss4R along the same lineage, one was classified as being Ss4R 
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and the rest were classified as being SSD, with the oldest duplication being the Ss4R, 
an assumption that did not affect the trends in the data. Although most gene tree 
topologies were consistent with the teleost species tree, some gene trees showed 
large deviations from the accepted species tree. These trees may have been influ- 
enced by phylogenetic error which could cause spurious duplication counts and 
cause an overestimation of the number of duplication events within a gene family. 
Conditional probabilities were then calculated to determine the fraction of retained 
gene duplicates following each of the WGDs, given the opportunity for retention. 

To assess if duplicate retention was impacted by protein-protein interac- 
tions, known protein-protein interactions were downloaded from the STRING 
database”. BLAST against Danio rerio was performed and putative STRING inter- 
actions in salmon were determined. Only interactions labelled ‘binding’ were kept, 
which are putative physical protein-protein interactions based on various forms 
of evidence. Patterns of co-retention following Ts3R, Ss4R, and SSD were then 
examined among STRING binding partners using the phylogenetic trees described 
above with custom perl scripts. 

Statistical tests of significance were performed to determine if duplication 
counts were significantly different from each other. The duplication process was 
represented by a binomial distribution where each duplication could have either 
been retained or not. A two-proportion pooled z-test was performed to calcu- 
late two-sided P values at the Bonferroni corrected c-level (0.001/7). To further 
explore if results were significant with a marginal effect level change or being overly 
influenced by large sample sizes, an odds ratio and relative risk analysis was per- 
formed for each group and two-sided P values were calculated. All tests showed 
extremely low P values indicating that the groups were significantly different from 
one another’!. Effect sizes were considered as the fractional change in mean values. 

All scripts used in this analysis are freely available on the Liberles Group 
website at Temple University (USA) at https://liberles.cst.temple.edu/public/ 
Salmon_Genome_Project/. 

Use of salmon assembly to improve rainbow trout genome sequence. Salmon 
chromosome sequences were repeat masked using a salmon repeat database and 
RepeatMasker v4.0.3 (ref. 72) and aligned against rainbow trout scaffolds’? using 
MegaBLAST”’. Rainbow trout scaffolds mapping to multiple salmon chromosomes 
were broken when supported by information from a rainbow trout linkage map 
containing 31,390 SNPs constructed in a family material of 2,464 individuals using 
Lep-MAP”. The relative positions of trout scaffolds within the salmon genome 
were used, together with trout linkage maps, to position, orient and concatenate 
11,335 rainbow trout scaffolds into 29 single chromosome sequences (1.37 Gb). 
Nomenclature for rainbow trout chromosomes is based on ref. 35. Conserved syn- 
tenic blocks between rainbow trout and Atlantic salmon were determined by align- 
ing chromosome sequences for the two species against each other using LASTZ™. 
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Extended Data Figure 1 | Atlantic salmon and rainbow trout comparative map. Alignment of Atlantic salmon (Salmo salar) and rainbow trout 
(Oncorhynchus mykiss) chromosome sequences using LASTZ demonstrates conservation of large collinear syntenic blocks between the two species. 


© 2016 Macmillan Publishers Limited. All rights reserved 


ARTICLE 


a b 
@ Salmo-Oncorhynchus divergence oO 
a 
@ Salmonid whole genome duplication 
Omy-b i 
° 8 
7) 
£ 
a: 
Ssa-b & fod 
= 
Fa 
ao 
Ssa-a Ee 
o 
sc) iS 
°° fo) 
5 
Omy-a 5 é 
Oo 
El ® 
20 40 60 80 100 120 
Homeolog divergence age 
c d 
21 me 


© Homeolog divergence 
@ Lineage divergence 


50 


Frequency 
Age of lineage divergence 


20 40 60 80 100 120 
20 40 60 80 100 120 


Million years ago . 
Homeolog divergence age 


Extended Data Figure 2 | Dating or Ss4R rediploidization. a, Schematic divergence age and Ss4R divergence age from time calibrated gene trees 


representation of a gene tree topology reflecting rediploidization of Ss4R estimated with BEAST. Modes of each distribution are indicated with a 
homeologues before Salmo-Oncorhynchus divergence. b, Correlation vertical line. d, Correlation between estimated age of Salmo-Oncorhynchus 
between genomic similarity in 1 Mb windows and Ss4R rediploidization divergence and Ss4R divergence age. 
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Extended Data Figure 3 | Duplication count analysis and interacting 
partner co-retention. The duplication process is depicted with the 
associated conditional probabilities for each type of duplication based 
upon a sampling of gene families that includes Lepisosteus oculatus. 
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Extended Data Figure 4 | Tissue gene expression regulation. 

a, Hierarchical clustering of tissue gene expression in adult salmon from 

fresh water. WT = expression data from normal diploid Atlantic salmon. 

Sally = expression data from the double haploid fish used for reference genome 
sequencing. b, Classification of 11 co-expression clusters. Gene expression are 
from 15 tissues from a diploid adult Atlantic salmon from freshwater. 
Co-expression clusters are either associated with expression patterns from 

a single tissue or multiple tissues with similar physiological functions. 
Co-expression clusters A-K are named accordingly after the tissue(s) that 
contributes the most to its characteristic expression regulation profile: skin; 
skin and muscle; nose and gill; kidney; gut and pyloric ceca; heart and liver; 
unspecific; brain; eye; testis and ovary; testis. c, Gene expression correlation 
between salmon Ss4R homeologues and Northern pike orthologues. P = pike, 
S1=salmon homeologue with lowest tissue expression correlation with 
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pike, $2 = salmon homeologue with highest tissue expression correlation 

to. d, Tissue expression specificity. Tissue expression specificity of Ss4R 
homeologues with novel gene regulation (S1) and conserved gene regulation 
(S2) compared to pike. Gene co-expression clusters are denoted A-K (see 
description in figure legend for b). Significantly different tissue specificity 
between diverged (S1) and conserved (S2) homeologues are indicated with 

a P value in the figure. e, Relationship between CDS-length difference and 
Ss4R expression regulation divergence. CDS length divergence are calculated 
as a fraction of the longest CDS in each Ss4R pair. Red colour represents 
homeologue pairs that are in different co-expression clusters (see above 
sections a and b for details). f, Illustration of sub- and neofunctionalization 
as defined by the analyses of ‘on’ and ‘off’ expression patterns. Red colour 
indicates a gene being ‘or’ in one tissue compared to its Ss4R duplicate and the 
assumed ancestral state of the diploid pike outgroup. 
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Sex -specific pruning of neuronal 
synapses in Caenorhabditis elegans 


Meital Oren-Suissa!?*, Emily A. Bayer)?:? & Oliver Hobert!?. 


Whether and how neurons that are present in both sexes of the same species can differentiate in a sexually dimorphic 
manner is not well understood. A comparison of the connectomes of the Caenorhabditis elegans hermaphrodite and male 
nervous systems reveals the existence of sexually dimorphic synaptic connections between neurons present in both sexes. 
Here we demonstrate sex-specific functions of these sex-shared neurons and show that many neurons initially form 
synapses in a hybrid manner in both the male and hermaphrodite pattern before sexual maturation. Sex-specific synapse 
pruning then results in the sex-specific maintenance of subsets of these connections. Reversal of the sexual identity of 
either the pre- or postsynaptic neuron alone transforms the patterns of synaptic connectivity to that of the opposite 
sex. A dimorphically expressed and phylogenetically conserved transcription factor is both necessary and sufficient to 
determine sex-specific connectivity patterns. Our studies reveal new insights into sex-specific circuit development. 


Like other invertebrate or vertebrate nervous systems, the nervous 
system of C. elegans contains a number of sex-specific neurons, most 
of which are generated during the process of sexual maturation in late 
larval stages!?. Apart from these sex-specific neurons (8 in herma- 
phrodites, 91 in males), there are 294 neurons shared by both sexes, 
most of which are embryonically generated**. The hermaphrodite and 
male versions of these shared neurons display the same lineage his- 
tory, the same cell body position, share molecular features (for exam- 
ple, neurotransmitter identity) and display similar neurite projection 
patterns*>~’. Intriguingly, the recent reconstruction of the posterior 
nervous system of the C. elegans adult male® and its comparison to 
the connectome of the hermaphrodite (a derived female)*® show that 
several of the sex-shared neurons are strongly sexually dimorphic in 
their synaptic wiring patterns. These anatomical observations provide 
a fascinating opportunity to study how seemingly similar sex-shared 
neurons develop sexually dimorphic characteristics. 


Synaptic target choices between sex-shared neurons 
Here we focus on a group of sex-shared—but dimorphically 
connected—sensory, inter- and motorneurons’ (Fig. 1a). The sexually 
dimorphic connectivity differences do not simply reflect sex-specific 
modifications of similar neuronal circuits, but rather sex-shared neu- 
rons wire into completely distinct circuits (Fig. 1a). For example, 
the PHB phasmid sensory neuron synapses onto three different sex- 
shared command interneurons only in hermaphrodites (AVA, AVD 
and PVC). In males, PHB connects to a sex-shared interneuron, AVG, 
which in turn connects to downstream tail motor neurons only in 
males (Fig. 1a). To examine whether dimorphic connections are due 
to dimorphic synaptic partner choice or a reflection of sex-specific 
neuron or process placement, we analysed serial electron micrographs 
and found that the phasmid neuron processes are directly adjacent 
to the AVG process in both sexes (Extended Data Fig. 1). Dimorphic 
connections between these neurons are therefore a consequence of 
sex-specific synaptic partner choice. 

Although electron microscopy provides a powerful tool to identify 
synaptic partners, the presently available electron microscopy analysis 
relies on one or two animals at a single stage (adult). To confirm the 
electron microscopy results, and to visualize the reproducibility as 


well as the developmental aspects of dimorphic synapses, we used 
two distinct trans-synaptic labelling techniques (Fig. 1b), “GRASP” 
(GFP reconstitution across synaptic partners)? and ‘iBLINC’ (in vivo 
biotin labelling of intercellular contact)'°. We generated transgenic 
lines in which seven distinct synaptically connected neuron pairs are 
labelled with GRASP and/or iBLINC. Using cytosolic mCherry to label 
neurites, we examined synaptic puncta along these mCherry-labelled 
adjacent processes. For all seven synaptic connections examined, we 
reproducibly identified discrete synaptic puncta that appear in the sex- 
ually dimorphic manner predicted by the electron microscopy analysis 
(Fig. la, b, Extended Data Figs 1-3). 


Sexually dimorphic functions of sex-shared neurons 

To assess whether sexually dimorphic wiring is an indication of 
dimorphic neuronal function, we either surgically removed individual 
dimorphically connected neurons or genetically silenced them using 
ectopic expression of a histamine-gated chloride channel'’. Silencing 
of the PHB neurons (using a driver that is strongly and consistently 
expressed only in PHB; Extended Data Fig. 4) affected forward loco- 
motion of hermaphrodites, but not of males (Extended Data Fig. 5a). 
Another previously described PHB function also displays notable 
sex-specificity. Specifically, it has been shown that the PHB sensory 
neuron modulates chemorepulsive behaviour of hermaphrodites in 
response to the noxious chemical sodium dodecyl sulfate (SDS). 
The modulatory effect of PHB is observed upon functionally disa- 
bling subsets of head sensory neurons’. Such head-neuron-defective 
animals fail to avoid SDS because PHB provides an antagonistic input 
to command interneurons"”. This antagonistic input to the command 
circuit is revealed through ablation of PHB, which restores the ability 
of head-sensory-neuron-defective animals to avoid SDS'*. We 
corroborated this antagonistic input through silencing of PHB with 
a histamine-gated chloride channel, by examining ceh-14 mutant 
animals, in which all three phasmid neurons fail to adopt their 
glutamatergic identity'*, and by silencing both ASH and PHB neurons 
(Fig. 2a, Extended Data Fig. 5d, e). Importantly, male PHB does not 
connect to command interneurons and no antagonistic input to the 
command circuit is expected. One would therefore predict that the 
disabling of head sensory-neuron function in males (ASH silenced and 
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Figure 1 | Visualizing sexually dimorphic synapses. a, Connectivity 
of selected neurons at the adult stage, as inferred from serial section 
reconstructions of electron micrographs**. Chemical synapses between 
sensory (triangles), inter- (hexagons) and motor (circles) neurons 

are depicted as arrows. Thickness of arrows correlates with degree of 
connectivity (number of sections over which en passant synapses are 
observed). The inset indicates where synaptic connections are formed. 
Black/grey, shared-sex neurons; white, male-specific neurons. Arrows 
indicate hermaphrodite-specific (red) and male-specific (blue) chemical 
synapse between shared neurons. b, Visualizing sexually dimorphic 
synapses. Fluorescent micrographs of GRASP GFP signal in preanal 


ablated; or tax-4 mutants) should not have the impact on SDS avoid- 
ance that is observed in hermaphrodites. We indeed found this to be 
the case (Fig. 2a, Extended Data Fig. 5). 

As adult male PHB neurons are not involved in the avoidance of 
noxious chemicals, we next investigated the function they adopt in 
males. We noted that the male-specific innervation target of PHB, 
the interneuron AVG, becomes innervated by male-specific hook 
sensory neurons (HOA and HOB), known to be involved in sensing a 
hermaphrodite-derived signal that induces males to stop at the vulva 
during mating behaviour'*. Moreover, PHB also innervates HOA 
directly®. Through genetic silencing, genetic ablation and microsur- 
gical laser ablations, we found that PHB and AVG are involved in 
responding to this vulva stop signal (Fig. 2b, c). Ablation of VD 13, 
one of the motor neurons innervated by AVG specifically in males 
also caused vulva location defects (Fig. 2c), as did ablation of the LUA 
neuron (Extended Data Fig. 6). PHB also innervates the male-specific 
PVX and PVY neurons, involved in mate-contact-induced backward 
locomotion in males!*, and silencing PHB also affected this sexually 
dimorphic behaviour (Fig. 2d). We conclude that PHB has sexually 
dimorphic functions, modulating locomotion and processing repul- 
sive environmental sensory information in hermaphrodites, while 


ganglion region outlined in the inset in Fig. la. Neuronal processes are 
labelled with cytoplasmic codon-optimized Cherry markers. GRASP 

data are shown, additional iBLINC data are shown in Extended Data 

Fig. 3. Expression pattern of the promoters used in this study to drive cell- 
specific expression can be found in Extended Data Fig. 4. Quantification 
of data are shown in Fig. 3b; the number of fluorescent puncta (outlined 
with white boxes) is similar to those observed in the electron microscopy 
analysis°. Gut; auto-fluorescence gut granules. Scale bars, 101m. In all 
images anterior is left, and dorsal is up. Blue and red indicate male and 
hermaphrodite, respectively, in all figures. 


sensing hermaphrodite-derived mating cues in males (summarized 
in Extended Data Fig. 6c). 

Sex-shared interneurons also display sexually dimorphic functions. 
We found that the dimorphically connected LUA neuron was also 
involved in response to vulva stop signal in males, and automated 
worm-tracking analysis’* revealed that ablation of LUAs results in 
pausing defects in hermaphrodites but not in males (Extended Data 
Fig. 6). 


Sexually dimorphic synapse patterning 

We next asked how sexually dimorphic connectivity patterns are estab- 
lished. With the exception of the VD13 neuron (which is born at the 
end of the first larval stage’), the shared neurons that we analysed are 
born and project their neurites during embryogenesis. Sex-specific neu- 
rons in the tail are born in the last larval stage” and male-specific behav- 
iours emerge soon after’’. One could therefore envision that dimorphic 
connections between the embryonically born neurons could form dur- 
ing sexual maturation, that is, long after the respective neurons and 
axonal patterns have been established in the animal (‘late maturation’ 
model; Fig. 3a). Alternatively, a ‘default connectivity’ could be estab- 
lished by both sexes early in development, followed by a sex-specific 
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Figure 2 | Functional repurposing of dimorphic neurons. 

a, Chemosensory repulsion assays (see Methods for full description of 
behavioural assays). Scatter diagrams plotting the avoidance index of 
single animals. Each black dot represents the fraction of reversal responses 
(scored as reversing or not reversing) in 10 or more assays of a single 
animal. Magenta vertical bars represent the median. The left column 
indicates predictions of reversal behaviour, based on previously published 
data that demonstrated a strong reversal drive from head neurons (thick 
arrows) that is counteracted by a forward drive mediated by the PHB 
neurons in the tail!*. Control experiments for silencing using histamine 
and additional SDS assays can be found in Extended Data Fig. 5. 
sra-6p::HisCl1 and gpa-6p::HisCl1 were used for ASH and PHB silencing, 
respectively. Summary of dimorphic behaviours induced by PHB sensory 
neurons can be found in Extended Data Fig. 6c. b, Changes in male 
movement and posture triggered by mate contact'*. c, Mutant or laser- 
operated animals tested for the male vulva location efficiency. In lin-11 
and ceh-14 mutants, the AVG and phasmid neurons, respectively, fail to 
differentiate!>, Error bars, s.e.m. d, Initiation of backward movement 
in response to hermaphrodite contact is dependent on PHB activity, 
measured as contact response efficiency. Each dot represents one animal. 
We performed the nonparametric Mann-Whitney test (Wilcoxon rank 
sum test) with Bonferroni correction for multiple comparisons (a, c) 

and Fisher’s exact test (d). ****P< 0.0001, ***P< 0.001, **P< 0.01, 

*P < 0.05; NS, not significant. 


synapse rewiring upon sexual maturation (‘rewiring model; Fig. 3a). 
By analysing connectivity throughout larval development using the 
GRASP and iBLINC systems, we discovered two different mechanisms: 
a ‘pruning’ mechanism, used by many different neurons, and a ‘prepat- 
terning’ mechanism used by others (Fig. 3a). 

For example, PHB neurons connect to AVA and AVG in her- 
maphrodites and males at early larval stages (Fig. 3b, c). Therefore, 
male-specific pruning of the PHB-AVA synapses results in an 
adult male-specific PHB-AVG connection, and conversely, 
hermaphrodite-specific pruning of the PHB-AVG synapses results in 
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the hermaphrodite-specific PHB-AVA synapses. We observed these 
synaptic pruning events to occur during sexual maturation in the 
fourth larval stage (L4) (Extended Data Fig. 7). 

As the PHB-AVA connection exists in both sexes at the L1 stage, 
we asked whether at this stage both hermaphrodites and males dis- 
play a PHB-dependent modulation of the repulsive response to nox- 
ious chemicals. We indeed found this to be the case (Fig. 2a). The 
repulsive response was restored in both sexes upon silencing of PHB 
(Extended Data Fig. 5f). These observations demonstrate that PHB 
does not merely acquire dimorphic functions in the adult, but that 
PHB neurons in males undergo a repurposing of function, from 
initially being involved in sensing noxious environmental cues to 
processing sex-specific cues. 

Sexually dimorphic pruning of synaptic connections can also 
be observed in a number of additional neuronal contexts (Fig. 3, 
Extended Data Fig. 2). For example, PHA connects to AVG in both 
sexes at the L1 stage, but this connection is selectively lost in males 
(Fig. 3b, c). Similarly, the AVG interneuron connects to the choliner- 
gic DA9 motor neuron in both sexes at the L1 stage, but the synaptic 
connection persists only in males, not hermaphrodites (Figs 1b, 3b, c). 

One other sexually dimorphic synaptic connection arises by a 
fundamentally different principle: the AVG to VD13 connection is 
only ever observed in males, and never in hermaphrodites (Figs 1b 
and 3b, c). Some prepatterning is also already evident in the pruned 
AVG-DA9 synapses; these are present in both sexes in the L1 stage, 
but are stronger in the male (Fig. 3b, c). Analysis of ~1,000 serial 
electron microscopy sections shows that the axons of the AVG and 
VD13 (synaptically connected in a male-specific manner) are more 
adjacent in males compared to hermaphrodites (Extended Data 
Fig. 1). Even though we cannot exclude the possibility that dimorphic 
adjacency is merely a secondary consequence of failure to establish 
synaptic contact, we propose that the prepatterning of dimorphic 
synapse may be a consequence of dimorphic axon placement. Taken 
together, we identified two types of synaptic maturation events: 
pruning events that coincide with sexual maturation and dimorphic 
prepatterning events that precede sexual maturation (summarized 
in Fig. 3d). 


The sex of neurons controls synaptic patterning 

We next investigated whether sexually dimorphic wiring patterns are 
determined by the sex of both the pre- and postsynaptic cell, or by 
non-cell-autonomous processes. We addressed this question by gen- 
erating sexually mosaic animals through cell-type specific, ectopic 
expression of the fem-3 gene, which downregulates the global hermaph- 
roditic identity determinant TRA-1 (a Gli-type zinc finger transcription 
factor) thereby imposing a male identity on the specific cell-type in an 
otherwise hermaphroditic animal®*!*'°. Conversely, ectopic cell-type- 
specific expression of the intracellular domain of the TRA-2 receptor 
(‘TRA-2'”) feminizes cells via stabilization of the TRA-1 transcription 
factor in otherwise male animals!””°. We found that identity trans- 
formations of single neurons transformed synaptic wiring patterns to 
that of the opposite sex (Fig. 4, Extended Data Fig. 8). For example, 
masculinization of PHB results in a loss of the PHB-AVA connection 
in an otherwise hermaphroditic animal and a gain of the PHB-AVG 
connection, demonstrating that the sex of PHB dictates which con- 
nection is pruned or maintained. Non-dimorphic PHB connectivity 
at the L1 stage is not affected by sex-reversal (Extended Data Fig. 8). 
Notably, masculinization of PHB also restores the behavioural defects 
of head-sensory-neuron-disabled hermaphrodites (Fig. 2a). Thus, the 
behavioural differences between males and hermaphrodites in the 
noxious chemical response can be linked specifically to the sex of an 
individual neuron (Fig. 2). 

Not only is the sex of the presynaptic neuron a determinant of 
sex-specific wiring patterns, but the postsynaptic cell is also impor- 
tant for establishing proper connectivity (Fig. 4). Masculinization of 
AVG in hermaphrodites (in which PHB does not normally connect 
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to AVG) maintains the PHB-AVG connection. Similarly, masculin- 
ization of AVA disrupts maintenance of the PHB-AVA synapses in 
hermaphrodites (Fig. 4). 

Feminizing neurons cell-autonomously via expression of TRA-2“ in 
otherwise male animals generally produced the converse effects (Fig. 4). 
For example, feminization of the PHB neurons in male animals is 
sufficient to maintain the normally hermaphrodite-specific PHB-AVA 
synapse and prune the normally male-specific PHB-AVG synapses 
(Fig. 4). These ectopic PHB-AVA synapses seem to be functional, as 
tax-4 males with feminized PHBs reverse less and can antagonize the 
SDS response (Fig. 2a). 

We further probed the non-autonomous nature of maintaining 
sex-specific synapses by asking whether the maintenance of the PHB- 
AVG synapses in hermaphrodites, through masculinization of AVG, 
affects the maintenance of the hermaphrodite-specific PHB-AVA 
synapse. Indeed, in animals in which fem-3 is driven in AVG (resulting 
in stabilization of the PHB-AVG synapse), PHB-AVA synapse number 
is significantly reduced. Conversely, in hermaphrodites in which we 
masculinized AVA, the PHB-AVA synapses are not only pruned, but 
the PHB-AVG synapses are stabilized (Fig. 4). These results suggest a 
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Figure 3 | Synaptic patterning during 
development. a, Models of how sexually 
dimorphic connectivity patterns may arise 
during development. b, c, Quantification and 
fluorescent micrographs of synaptic pruning 
measured by the number of synaptic puncta 
observed using GRASP GFP (PHB-AVA, 
PHB-AVG, AVG-VD13, AVG-DA9) and 


iBLINC GFP (PHA-AVG) in L1 and adult 


Herm. Male Herm. Male 


hermaphrodites and in males. As VD neurons 
are born at the end of the L1 stage, juvenile 
VD13 puncta were quantified at the L2 

stage. We performed nonparametric 
Mann-Whitney test (Wilcoxon rank sum 

s. & : test) with Bonferroni correction for multiple 

: comparisons. ****P < 0.0001, ***P < 0.001, 
**P < 0.01, *P < 0.05; NS, not significant. 
Magenta horizontal bars represent the median. 
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number from L1 to L4 in the hermaphroditic 
PHB-AVA connection and the male PHB-AVG 
is in line with an overall increase in total synapse 
numbers seen between neurons between L1 and 
adult stage as deduced by recent reconstruction 
of an L1 stage animal (M. Zhen, personal 
communication). M, merge; N, neurite; P, puncta. 
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competition mechanism in which one synaptic wiring configuration 
is maintained at the expense of the ‘alternative’ wiring pattern. 


Doublesex-like transcription factors control dimorphic 
connectivity 

We next sought to determine the link between the globally acting sex 
determination system, mediated by TRA-1, and synaptic pruning. 
A recently framed hypothesis posits that globally acting hormonal 
signals in vertebrates operate through region-specific modular effec- 
tor system”’. In C. elegans, the global TRA-1 regulator may oper- 
ate in a cell-type-specific manner through one of the 11 members 
of the DMD (Doublesex/MAB-3 domain) family of transcription 
factors, which are conserved regulators of sexual identity in various 
organisms”. Three of these family members (mab-3, mab-23, dmd-3) 
have previously been implicated in somatic sex differences®, but the 
other eight have remained uncharacterized. We found that dmd-5 and 
dmd-11 are dimorphically expressed in the dimorphically connected 
AVG neuron described above; expression is observed in male AVG, 
but not in hermaphrodite AVG (Fig. 5a). This dimorphic expression 
is controlled cell-autonomously via the canonical sex-determination 
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Figure 4 | Autonomy and non-autonomy of sex-specific synapse 
pruning. Either the presynaptic cell (PHB) or the postsynaptic cells 
(AVA, AVG) were masculinized (by expression of FEM-3) and feminized 
(by expression of TRA-2'), and the number of synaptic puncta were 
quantified in hermaphrodites and males. We performed nonparametric 
Mann-Whitney test (Wilcoxon rank sum test) with Bonferroni correction 
for multiple comparisons. ****P < 0.0001, ***P < 0.001, **P<0.01. 
Magenta horizontal bars represent the median. Results are summarized 
in Extended Data Fig. 8a. Note that a general trend in the sex-reversal 
experiments is that the change of the sex of the presynaptic neuron 
(PHB) appears to have a stronger effect than changing the sex of either 
of the postsynaptic cells. Changing the sex of both postsynaptic cells 
simultaneously did not enhance the effects (Extended Data Fig. 8c). 


pathway and TRA-1, as expression of FEM-3—the negative regulator 
of hermaphroditic TRA-1 protein—in the AVG neurons derepresses 
dmd-5 and dmd-11 expression in hermaphrodites (Fig. 5a, Extended 
Data Fig. 9a). 

dmd-5 and dmd-11 single mutants show defects in the male-specific 
function of AVG in mating behaviour (Fig. 5b, Extended Data 
Fig. 9f-h) and dmd-5; dmd-11 double mutant animals show even 
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Figure 5 | Sexually dimorphic expression and function of dmd-5 and dmd-11. 
a, Sex-specific expression of dmd-5 and dmd-11 reporter genes in AVG. 
There are no additional dimorphisms in the retrovasicular ganglion, apparent 
differences are due to differences in z-planes incorporated into the final 
Z-stack projection. Masculinization of AVG derepresses dmd-5 and dmd-11 
expression in hermaphrodites AVG. Quantified in Extended Data Fig. 9a. 
b, Vulva location efficiency is affected in dmd-5 (n= 11), dmd-11 (n=20) and 
dmd-5;dmd-11 double mutant males (n = 29) compared with wild-type males 
(n=15). Expression of either dmd-5 (n= 10) or dmd-11 (n=10) in AVG 
(using the inx-18 promoter) of double mutant males rescues behaviour defects. 
Error bars, s.e.m. ¢, d, dmd-5 and dmd-11 are required for maintenance of 
AVG synapses. Fluorescent micrographs (c) and quantification of synaptic 
puncta of PHB-AVG (d). Region of neurite overlap and observed synaptic 
puncta marked with white boxes. Quantification of DMD effect on LUA-AVG 
synapses can be found in Extended Data Fig. 9i. We performed nonparametric 
Mann-Whitney test (Wilcoxon rank sum test) with Bonferroni correction 
for multiple comparisons (b, d). ****P < 0.0001, ***P < 0.001, **P<0.01, 
*P<0.05. Magenta horizontal bars represent the median. Scale bars, 101m. 
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stronger defects (Fig. 5b). These defects can be rescued by AVG- 
specific expression of either dmd-5 or dmd-11 (Fig. 5b). Moreover, 
dmd-5 single mutants and dmd-5; dmd-11 double mutants display the 
alterations in AVG synaptic wiring that one would expect from factors 
that control the sexually dimorphic nature of AVG wiring (Fig. 5c, d, 
Extended Data Fig. 9). Male-specific PHB-AVG synapses fail to be 
maintained in dmd-5; dmd-11 mutant males. Synaptic defects can be 
rescued through AVG-specific expression of dmd-5 (Fig. 5c, d). There 
are no synaptic defects observed in the L1 stage of dmd-5; dmd-11 
mutants when no synaptic dimorphism is yet apparent (Extended 
Data Fig. 9b), indicating that dmd-5 and dmd-11 are not required for 
synapse formation per se, but are specifically involved in controlling 
sex-specific synapse maintenance by preventing synaptic pruning, as 
predicted by their expression pattern. Furthermore, the PHB-AVA 
hermaphroditic connection is non-autonomously stabilized in dmd-5; 
dmd-11 mutant males, supporting the competition model discussed 
above (Extended Data Fig. 9e). 

dmd-5 is not only required but also sufficient to prevent synaptic 
pruning, as deduced by ectopic dmd-5 expression in the AVG neu- 
rons of hermaphrodites. In these animals the PHB-AVG synapses are 
maintained (Fig. 5d). As DMD proteins are generally thought to work 
as repressors~’, we propose that dmd-5, in conjunction with dmd-11, 
represses the expression of gene(s) in male AVG neurons that are 
involved in the pruning of AVG connections to PHB and that repres- 
sion of these pruning factor(s) in hermaphroditic AVG, via ectopic 
dmd-5 expression, inhibits pruning (summarized in Extended 
Data Fig. 9j). Notably, although AVG-masculinized dmd-5(+) 
hermaphrodites (masculinized through fem-3-driven degradation 
of TRA-1) do maintain PHB-AVG synapses, AVG-masculinized 
dmd-5(—) hermaphrodites do not (Fig. 5d, Extended Data Fig. 9d). 
This demonstrates that dmd-5 functions downstream of tra-1, as 
already suggested by the observation of ectopic dmd-5 expression in 
animals in which we degraded TRA-1 cell-autonomously in AVG (see 
above; Fig. 5a, Extended Data Fig. 9a). 

In conclusion, our studies show how sex-shared neurons adopt 
sex-specific synaptic wiring patterns. Sex-specific wiring patterns arise 
in a neuron-type specific manner. In most cases, we observe sexual 
maturation-coupled sex-specific pruning of synaptic connections that 
were indiscriminately generated in both sexes. In at least one case, 
sex-specific synapses are prepatterned before overt sexual maturation 
of the animal. We observed notable patterns of cellular autonomy of 
the synaptic pruning events and we found pruning to be regulated by 
sex-specifically expressed transcription factors. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 


Data reporting. No statistical methods were used to predetermine sample size. 
The experiments were not randomized and the investigators were not blinded to 
allocation during outcome assessment. 

C. elegans strains. Wild-type strains were C. elegans variety Bristol, strain N2. 
Worms were maintained according to standard methods™. Worms were grown 
at 20°C on nematode growth media (NGM) plates seeded with bacteria (E.coli 
OP50) as a food source. Mutant strains used in this study include: CB1489 
him-8(e1489) IV, CB4088 him-5(e1490) V, MT633 lin-11(n389) I; him-5(e1467) V, 
RB1295 F10C1.5(ok1394)/+ Il, FX01760 dmd-5(tm1760)/+ II, PR678 tax-4(p678) 
III, DA509 unc-31(e928) IV, TB528 ceh-14(ch3) X, VC30074 dmd-5(gk408945) II, 
VC1193 dmd-11(gk552) V. 

All transgenic strains used in this study are listed in Supplementary Table 1, 
ordered by figures and extended data figures. 

Cloning and constructs. To generate inx-18p::wCherry (pMO10), inx-18 second 
intron was amplified from pOH260 (ref. 25) and restriction sites were added to ends 
(5’-Sphl, 3/-PstI). This fragment was digested and ligated into pPD95.75 vector in 
which the GFP was replaced with codon-optimized mCherry (‘worm Cherry’). To 
restrict expression to AVG, AIY motif (5‘-ATTAGTTTCGTTAA-3’) was deleted 
from the 2"4 intron of inx-18 using site directed mutagenesis (Forward primer: 
5/-AATTTTTTTTCATGTTACCTACTTATTTTTTTTCTATTAGCGTCATAG 
AT-3/, Reverse: 5’-ATCTATGACGCTAATAGAAAAAAAATAAGTAGGTAACATG 
AAAAAAAATT-3’). inx-18 2nd intron is also dimly and variably expressed in URX. 

fem-3::SL2 was amplified from P(rab-3)::fem-3::SL2::mCherry Gateway 
construct!’ using PCR and KpnI restriction site was added to 5’ end. This PCR was 
digested and inserted into KpnI; Ascl digested pPD95.75 with wCherry, to generate 
fem-3::SL2::wCherry (pMO11). 

To generate inx-18p::fem-3::SL2::wCherry (pMO12), pMO10 was digested using 
SphI, PstI and ligated into digested pMO11. 

To generate inx-18p::nlg-1::gfp1-10 (pMO17), inx-18p::nlg-1::gfp11 (pMO18), 
inx-18p::BirA::nrx-1 (pMO19) and inx-18p::AP::nlg-1 (pMO24), inx-18p was 
cloned from pMO10 (inx-18p::wCherry) into the SphI, Xmal sites of MVC2 (pSM:: 
nlg-1::GFP1-10), MVC3 (pSM::nlg-1::gfp11) (a gift from M. VanHoven), gcyp-8:: 
BirA::nrx-1 and ttx-3p::AP::nlg-1 (a gift from H. Bulow), respectively. 

eat-4p9 (‘LUA promoter, a fragment of 172 bp upstream to the eat-4 gene) 
was amplified using PCR from eat-4p2 (ref. 11) and restriction sites were added 
to ends (5/-HindIII, 3’-BamHI). This fragment was digested and ligated into 
pPD95.75 vector, to generate eat-4p9::gfp (pMO13). eat-4p9 is expressed in the 
tail neurons LUAs and PVR. 

To generate eat-4p9::nlg-1::gfp 1-10 (pMO14), eat-4p9 was amplified from 
eat-4p2 and restriction sites were added to ends (5’ Xbal, 3’ Nhel). This fragment 
was digested and ligated into MVC2. 

To generate eat-4p9::fem-3::SL2::wCherry (pMO15), pMO14 was digested using 
Spel, Nhel and the fragment containing eat-4p9 was ligated into pMO11. 

eat-4p9::wCherry was generated by PCR fusion of eat-4p9 and wCherry”®. 

To generate eat-4p9::BirA::nrx-1 (pMO16), eat-4p9 was amplified from eat-4p2 
and restriction sites were added (5’-Xmal, 3/-SphI). This fragment was digested 
and ligated into digested gcy-8::BirA::nrx-1. 

To generate gpa-6p::nlg-1::gfp11 (pMO21), gpa-6p::BirA::nrx-1 (pMO22) and 
gpa-6p::fem-3::SL2::wCherry (pMO29), 2.6kb gpa-6 promoter was cloned from 
MVC6 (pSM::gpa-6p::nlg-1::gfp 1-10, a gift from M. VanHoven) into the SphI, 
Smal sites of MVC3 (pSM::nlg-1::GFP11), gcy-8p-BirA-nrx1 and pMO11, respec- 
tively. To generate gpa-6p::gfp the 2.6 kb gpa-6 fragment was cloned into the Sphl, 
Smal sites of pPD95.75. gpa-6 is expressed brightly and consistently in PHBs and 
also dimly and variably in AWAs. 

To generate flp-18p::fem-3::SL2::2XNLS::TagREP (pMO30), FEM-3 was cloned 
into an SL2::2XNLS::TagRFP plasmid using restriction-free (RF) cloning’’. The 
3.1 kb flp-18 PCR product was cloned into the SphI, Xmal sites of fem-3::SL2:: 
2XNLS::TagRFP. To generate flp-18p::gfp, the 3.1 kb flp-18 PCR product was cloned 
into the SphI, Xmal sites of pPD95.75. flp- 18 is expressed brightly and consistently 
in AVAs, and dimly and variably in AIYs. 

To generate srg-13p::BirA::nrx-1 (pMO23), a 3kb promoter fragment of 
srg-13 was amplified from genomic DNA (primer F; 5’/-GTACCTGCAGAA 
GGACTTGG CAGAAAGAAGC-3’, R: 5/-TGCACCCGGGGTGGGCTGTAATTT 
TTAGCTCG-3’), with PstI, Xmal sites added to ends. This was cloned into the PstI, 
Xmal sites of pPD95.75. A 2.2 kb srg-13 promoter was then cloned into the Sphl, 
Xmal sites of gcy-8p::BirA::nrx-1. 

To generate acr-2p::AP::nlg-1 (pMO25), acr-2p was cloned from pEVL194 
(acr-2p::nlg-1::gfp11, a gift from B. D. Ackley) into the SphI, Xmal sites of ttxp-3:: 
AP::nlg-1. 

To generate the cell-specific feminizing constructs, tra-2(ic)::SL2::2XNLS:: 
TagRFP (pMO37) was made by inserting tra-2(ic) from pENTRY 1-2 tra-2 (ic) 


(a gift from D. Portman) into an SL2::2XNLS::TagRFP construct using RF 
cloning. inx-18p::tra-2(ic)::SL2::2 xNLS::TagRFP (pMO32) was generated by 
subcloning inx-18p from pMO10 (inx-18p::wCherry) into the SphI, Xmal sites 
of pMO37. gpa-6p::tra-2(ic)::SL2::2XNLS::TagRFP (pMO33) was generated by 
subcloning gpa-6 promoter from MVC6 into the Sphl, Smal sites of pMO37. 
flp-18p::tra-2(ic)::SL2::2XNLS::TagRFP (pMO34) was generated by subcloning 
3.1kb flp-18 PCR product from MVC 12 into the SphI, Xmal sites of pMO37. 
eat-4p9::tra-2(ic)::SL2::2XNLS:: TagRFP was generated by ligating the digested 
eat-4p9 mentioned above into the SphI, Xmal sites of pMO37. 

To generate PHB histamine-induced silencing construct, a 2.6 kb gpa-6 pro- 
moter from MVC6 was cloned into pNP403 (tag-168p::HisCl1::SL2::gfp; a gift from 
N. Pokala and C. Bargmann) to replace the tag-168 promoter using RF cloning. 

To generate dmd-5 and dmd-11 genomic rescue constructs, dmd-5 and 
dmd-11 genomic sequences including 500 bp 3’-UTR were amplified from 
genomic DNA and cloned into pMO10 to replace wCherry using RF cloning. The 
resulting constructs are pMO31, inx-18p::dmd-5_genomic+3'-UTR and pMO38, 
inx-18p::dmd-11_genomic+3'-UTR. 

Microscopy. Worms were anaesthetized using 100 mM of sodium azide (NaN3) 
and mounted on 5% agar on glass slides. Worms were analysed by Nomarski 
optics and fluorescence microscopy, using a Zeiss 780 confocal laser-scanning 
microscope. When using GFP, we estimated the resolution of our confocal to be 
~250nm. Multidimensional data were reconstructed as maximum intensity projec- 
tions using Zeiss Zen software. Puncta were quantified by scanning the original full 
Z-stack for distinct dots in the area where the processes of the two neurons overlap. 
Figures were prepared using Adobe Photoshop CS6 and Adobe Illustrator CS6. 
Cell ablation. We performed laser ablations using a MicroPoint Laser System 
Basic Unit (N2 pulsed laser (dye pump), ANDOR Technology) attached to a Zeiss 
Axioplan 2IE widefield microscope (objective EC Plan-Neofluar 100x/1.30 Oil 
M27). This laser delivers 120 ,:Joules of 337 nm energy with a 3-nsec pulse length. 
Ablations were performed as previously described”, with pulse repetition rates of 
~15 Hz. Cell identification was performed with GFP or Cherry markers. Ablations 
were performed at the L4 stage, and worms were analysed 24-48 h later. Mock 
animals were placed on same slide under microscope but were not ablated, and 
were allowed to recover in a similar manner. After relevant assays were performed 
(tracking or mating assays), worms were mounted again on glass slides and 
analysed under microscope to validate that cell-ablation was successful. 

Mating behaviour assays. Mating assays were based on procedures described 
previously!”°. Males were picked at an early L4 stage and kept apart from her- 
maphrodites for 24h. One male was transferred to a plate covered with a thin 
fresh OP50 lawn containing 10-15 adult unc-31(e928) hermaphrodites. These her- 
maphrodites move very little, allowing for an easy recording of male behaviour. 
Hermaphrodites were also isolated from opposite sex at the L4 stage and used 
24h later. Animals were monitored and sequence of events was recorded within 
a 15 min window or until the male ejaculated, whichever occurred first. Males 
were tested for their ability to locate vulva in a mating assay, calculated as location 
efficiency”. The number of passes or hesitations at the vulva until the male first stops 
at the vulva were counted: location efficiency = 1/number of encounters to stop. 
PHB-silenced males were digitally recorded using the Exo Labs model 1 camera 
mounted on Nikon Eclipse E400 compound microscope with long-distance X20 
lens. These videos were analysed for vulva location efficiency and percentage 
of successful contact responses, which requires tail apposition and initiation of 
backward locomotion. Percentage response to contact = 100 x (the number of 
times a male exhibited contact response/the number of times the male makes 
contact with a hermaphrodite via the rays). 

SDS-avoidance behaviour. SDS-avoidance assay was based on procedures 
described previously!*. A small drop of solution containing either the repellent 
(0.1% SDS in M13 buffer) or buffer (M13 buffer: 30 mM Tris-HCl (pH 7.0), 
100mM NaCl, 10mM KCl) is delivered near the tail of an animal while it moves 
forward. Once in contact with the tail, the drop surrounds the entire animal by 
capillary action and reaches the anterior amphid sensory organs. The drop was 
delivered using 10 1] glass calibrated pipets (VWR international) pulled by hand 
ona flame to reduce the diameter of the tip. The capillary pipette was mounted in 
a holder with rubber tubing and operated by mouth. Assayed worms were trans- 
ferred to fresh non-wet unseeded NGM plates and allowed to rest at room temper- 
ature for a few minutes. Each assay started with testing the animals with drops of 
buffer alone. The response to each drop was scored as reversing or not reversing. 
The avoidance index is the number of reversal responses divided by the total 
number of trials. An inter-stimuli interval of at least two minutes was used between 
successive drops to the same animal. Each animal was tested at least 10 times. 
The L1 animals were scored blindly and sex was noted 48 h later. 

Neuronal silencing using the histamine chloride channel 1 (HisCl1) system. 
ASH::HisCl1 (kyEx5104), PHBp::HisCl1 (otEx6341), PHBp::HisCl1; tax-4(p678) 
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and ASH::HisCll; PHBp::HisCl1 transgenic animals were picked at the L4 stage 
and placed on NGM plates containing 10 mM histamine with OP50 bacteria as 
food source. As a control, animals were placed on NGM plates containing OP50 
bacteria but no histamine. Chemorepulsion behaviour assays were performed 
24h later. Histamine plates were prepared as previously described*". As additional 
controls, panneuronal HisCl1 transgenic worms (CX 14373 kyEx4571 (pNP403 
(tag-168::HisCl1::SL2::gfp), myo-3::mCherry); a gift from C. Bargmann) were placed 
on histamine plates prior to use. Histamine plates on which tag-168::HisCl1 animals 
were paralyzed under a minute were used for behavioral assays. There was no 
difference in the avoidance index of wild-type worms grown on histamine plates 
and on plates without histamine. 
Automatic worm tracking. Hermaphrodite and male transgenic otIs462 animals 
were ablated at the L4 stage and left to recover for 24h before tracking. PHB- 
silenced hermaphrodites and males were transferred into NGM plates containing 
10 mM histamine for 24h before tracking. At the adult stage, animals were placed 
on an NGM plate seeded with diluted 2011 of OP50 bacteria in the centre. As a con- 
trol, mock ablated transgenic otIs462 animals were used. Automatic tracking was 
performed at ~22°C (room temperature) with Worm Tracker 2.0 (WT2), which 
uses a mobile camera to track and record individual worms and 5 min videos were 
generated. Analysis of the tracking videos was performed as previously described’®. 
To assess the effect of LUA ablation, we first conducted a pilot tracking study 
of LUA-ablated hermaphrodites worms versus controls and LUA-ablated males 
versus controls. Owing to the extensive analysis of the tracking system, 702 features 
were initially measured. After correction for multiple testing, pausing and pausing- 
related features emerged as the main, significant difference between LUA-ablated 
males and hermaphrodites. Therefore, to obviate further heavy corrections for 
multiple testing, we chose to re-run these experiments, this time only measuring 
pausing, permitting us to use of the uncorrected P value. Within this subset of 
new experiments, wherein only one feature was measured (that is, pausing), we 
found highly significant P values. Given the necessity of four tests to make the 
proof, we chose the most conservative correction possible for multiple testing, 
Bonferroni, to illustrate that the P values measured maintain significance even 
after such extreme correction. 
Analysis of electron microscopy data. Original transmission electron microscopy 
serial electron micrographs from ‘JSE’ hermaphrodite tail series and ‘N2Y’ male 
tail series were analysed using Elegance software*. 


ARTICLE 


Images were aligned and screened for adjacency of neuronal processes of the 
pairs of cells described in this manuscript. For JSE, images from ‘PAG’ and ‘red 
series’ were analysed, and for N2Y, ‘PAG’ images were analysed. 
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Hermaphrodites Males 


wf 


Sex-specific] # Sections processes are adjacent 
Synapses - 
connection | Hermaphrodite | -Male__| 


PHB>AVG M 123 206 
AVG>VD13 M 14 193 
PHA>AVG H 198 285 


Extended Data Figure 1 | Adjacency of neuronal processes in PHB-AVG adjacent processes, pseudo labelled in green (AVG) and red 
hermaphrodites and males. Four transmission electron microscopy (PHB). b, Print 359, JSE series (JSE_122257; http://wormimage.org/image. 
prints from wild-type adult hermaphrodite ‘JSE’* and four from adult php?id=122257&page=2) and print 500, N2Y series (PAG500; http:// 
male ‘N2Y;, showing adjacency of neuronal processes. These images were wormimage.org/image.php?id=103408&page=20) shows AVG-VD13 
collected at MRC/LMB for ref. 8 and ref. 2, and the annotated images adjacent processes, pseudo labelled in green (AVG) and pink (VD13). 
are available online at http://www.wormimage.org, courtesy of D. Hall. c, Print 377, JSE series (JSE_122275; http://wormimage.org/image. 
The set of processes directly adjacent to one another has been defined php?id=122275&page=2) and print 800, N2Y series (PAG800; http:// 
as the ‘neighbourhood’ of that process**, and the placement of processes wormimage.org/image.php?id=103706&page=16) shows PHA-AVG 
into specific neighbourhoods is a major determinant of connectivity. adjacent processes, pseudo labelled in green (AVG) and orange (PHA). 
Connections form only in one sex, although processes are adjacent in d, A table summarizing the number of electron microscopy sections in 
both sexes. a, Print 385, JSE series (JSE_122283; http://wormimage.org/ which direct adjacency of processes was observed. Over a 1000 PAG 
image.php?i=122283&page=2) and print 620, N2Y series (PAG620; (preanal ganglion) serial sections were analysed for each sex. 


http://wormimage.org/image.php?id=103528&page=18) shows 
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white = male-specific neurons 


a red: hermaphrodite-specific chemical synapse between shared neurons blue: male specific chemical synapse between shared neurons 


black/grey = sex-shared neurons 
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Extended Data Figure 2 | Dimorphic connections of LUA. a, The between LUA-AVG and LUA-AVA, in L1 and adult hermaphrodites 


connectivity diagram shown in Fig. la, including dimorphic connections 
of the LUA and PHC connections. The GRASP data that we show in this 
figure as well as the pruning data, sexual reversal data and mutant data 
shown in Extended Figs 6, 8 and 9, supports the original LUA connectivity 
data reported in ref. 5 and summarized in this schematic. However, a 
recent reassessment of the tracing of electron micrographs suggests that 
the connectivity assignments of the PHC and LUA neurons may have 
been swapped with each other (S. Emmons, personal communication). 
b, Overview of LUA synaptic connections labelled in this paper. 

c, Visualizing LUA sexually dimorphic synapses. Quantification and 
fluorescent micrographs of GRASP trans-synaptically labelled puncta 


and in males. M, merge; N, neurite; P, puncta. Region of neurite overlap 
and observed synaptic puncta are marked with white boxes. Gut, 
auto-fluorescence gut granules. Scale bars, 101m (adult) and 5 1m (L1). 
d, Fluorescent micrographs of the preanal ganglion region of transgenic 
animals expressing the presynaptic BirA::nrx-1 fusion in LUA (using the 
eat-4p9 promoter), and postsynaptic acceptor peptide::nlg-1 fusion in 
AVG. For more details see Extended Data Fig. 3. Scale bars, 101m. We 
performed nonparametric Mann-Whitney test (Wilcoxon rank sum test) 
with Bonferroni correction for multiple comparisons. ****P < 0.0001, 
***P < 0.001, **P < 0.01; NS, not significant. Magenta horizontal bars 
represent the median. 
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AVG VD13 |o" inx-18p::nlg-1::spGFP1-10 unc-25p::nlg-1::spGFP11 
~~ inx-18p::nlg-1::spGFP1-10 acr-2p::nlg-1::spGFP11 
DAQ inx-18p::BirA::nrx-1 acr-2p::AP::nlg-1 
AVG gpa-6p::nlg-1::spGFP1-10 inx-18p::nlg-1::spGFP11 
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Extended Data Figure 3 | Trans-synaptic labelling of dimorphic 
connections using iBLINC. a, Overview of synaptic connections labelled 
in this paper (Fig. 1 and this figure). Some connections were labelled with 
both GRASP and iBLINC, yielding similar results. We generally note that 
the number of synapses is roughly reproducible from animal to animal and 
the number of the fluorescent dots is roughly comparable to the number 
of synapses identified by the electron microscopy analysis. However, 

there is also some variance from animal to animal (quantified in Fig. 3), 
consistent with previous analysis™. b, Labelling data not shown in Fig. 1. 
Fluorescent micrographs of the preanal ganglion region of transgenic 
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animals expressing the presynaptic BirA::nrx-1 fusion in PHB (using the 
gpa-6 promoter), AVG (using the inx-18 promoter) and PHA (using the 
srg-13 promoter), and postsynaptic acceptor peptide::nlg-1 fusion in AVG 
and DA9 (using the acr-2 promoter). Transgenic worms also express the 
streptavidin detector fused to 2 x sfGFP from the coelomocytes (unc-122 
promoter)’®. Neuronal processes are labelled with cytoplasmic Cherry 
markers of the iBLINC pairs. Region of neurite overlap and observed 
synaptic puncta are marked with white boxes. Scale bars, 10 1M. Anterior 
is left and dorsal is up. 
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Extended Data Figure 4 | Specificity of driver lines. a, A 2.6 kb gpa-6 patterns*>. c, The 1.8 kb inx-18 2nd intron fused to codon-optimized 
promoter fragment fused to GFP is expressed consistently in PHB at all Cherry is expressed brightly and consistently in AVG, and dimly and 
stages. There is also faint and variable expression in AWA. b, The 3.1 kb variably in URXs. The AIY motif present in this fragment was deleted 
flp-18 promoter fused to GFP is expressed consistently in AVA, and dimly (see Methods). d, The 170 bp eat-4 promoter (eat-4p9) fused to GFP is 
and variably in AITY (85% of animals) and RIM (15% of animals), which expressed in LUAs and PVR. 

were identified based on comparison to published flp-18 expression 
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Extended Data Figure 5 | Additional SDS avoidance assays. a, PHB 
silenced hermaphrodites move slower forward (quantified as absolute 
mid-body speed) and as a result cover less of the plate (quantified as forward 
path range) compared to control hermaphrodites, whereas PHB-silenced 
males do not show any difference compared to control males. In addition, 
the tail-bending wave is affected in PHB-silenced hermaphrodites, but not 
in males (quantified as tail crawling frequency). Statistics were computed 
using Wilcoxon rank-sum test, and correction for multiple testing (q values) 
was computed across all measures (approximately 1404 tests) using the 
Benjamini-Hochberg procedure. b, Silencing of ASH neuronal activity 
using the histamine chloride channel 1(HisCl1) affects the animals’ 
chemosensory avoidance response. Males and hermaphrodites were assayed 
for effects of histamine on SDS avoidance behaviour. We used the him-5 
mutant background (which gives a high incidence of male progeny) as 

wild type. There is no difference between worms assayed in the presence 
and absence of histamine (Fig. 2a). The avoidance index of single animals 
was calculated as the fraction of reversal responses in 10 or more assays, 
depicted as black dots. Magenta vertical bars represent the median. 

L4 animals carrying the kyEx5104 (pNP424 (sra-6::HisCl1::SL2::mCherry))" 
transgene were grown on 10 mM histamine-containing NGM plates for 
24h. Asa control, kyEx5104 animals were grown on NGM plates without 
histamine. ASH silencing reduces the head sensory response to SDS, thus in 
hermaphrodites the antagonizing activity of the PHBs inhibits the backward 
movement and the worms do not reverse. In males, no such antagonizing 
activity occurs and the worms reverse, albeit with reduced ability. c, Ablation 
of ASH neurons affects the animals’ chemosensory avoidance response in 

a similar manner to histamine-induced silencing. sra-6::gfp was used to 
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identify the ASH neurons. d, Behavioural differences stem from dimorphic 
connectivity differences and not from amphid/phasmid sensory function. 
tax-4; ceh-14 double mutants behave in a similar manner in both sexes. 
PHB silencing (gpa-6p::HisCl1::SL2::gfp) does not affect behaviour in either 
sex. Silencing both the ASHs and PHBs in males showed no difference 
compared to ASH-silenced males. However, silencing ASHs and PHBs in 
hermaphrodites showed a significant difference compared to ASH silenced 
hermaphrodites, where we expect the PHBs to function in an antagonistic 
manner; thus in its absence, ASH-silenced hermaphrodites showed an 
increased ability to respond to SDS by reversing. e, PHB silencing in tax-4 
mutant background. tax-4 is a subunit of a cyclic nucleotide gated channel 
expressed in chemosensory and thermosensory neurons”, see g. tax-4 
animals show a strongly reduced avoidance response to SDS”. Silencing 

of PHBs in tax-4 hermaphrodites eliminated the antagonizing affect and 
animals were able to avoid SDS by backing. f, PHB silencing in tax-4 mutant 
background at the L1 stage. Lack of avoidance seen in tax-4 L1 males and 
hermaphrodites depends on PHB function. For all panels, we performed the 
nonparametric Mann-Whitney test with Bonferroni correction for multiple 
comparisons. ****P < 0.0001, **P<0.01, *P< 0.05; NS, not significant. 

g, tax-4 expression pattern is identical in hermaphrodites and males. 
kyEx744 (tax-4p::TAX-4::¢fp)*®, was analysed in adult male and 
hermaphrodites. Amphid neurons in the head (ADL, ASH, ASI, ASJ, ASK, 
AWB) were stained using DiD to facilitate cell identification. Neurons 
identified, shown in the ‘Merge’ panel are identical in both sexes and 

match published data. All neurons are bilaterally symmetric left-right pairs, 
and for simplicity only left cells are shown. Scale bars, 101M. 
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reported connectivity data shown in Extended Data Fig. 2. We performed 
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LUA and AVA in hermaphrodites. Feminizing LUA, AVA and AVG 

by expression of TRA-2!© was sufficient to induce ectopic LUA-AVA 
puncta in males. We performed the nonparametric Mann-Whitney test 
with Bonferroni correction for multiple comparisons. ****P < 0.0001, 
*P <0.01; NS, not significant. Magenta horizontal bars represent the 
median. 
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Extended Data Figure 9 | dmd-5 and dmd-11 expression, sequence and 
function. a, Quantification of dimorphic expression of dmd-5 and 
dmd-11 in AVG. Expression in hermaphrodites was off or extremely 
faint. Expression of inx-18p::FEM-3 derepressed dmd-5 and dmd-11 
gene expression in hermaphrodite AVGs. Statistics calculated using 
Fischer's exact test. b, Quantification of the number of PHB-AVG 
synaptic puncta in L1 dmd-5(gk408945); dmd-11(gk552) double 
mutants, compared with wild-type L1 animals. At the L1 stage, 

dmd-5 and dmd-11 do not affect PHB-AVG synapses, suggesting they 
are required for maintenance of mature synapses. c, dmd-5 single 
mutants and dmd-5; dmd-11 double mutants display similar alterations 
in AVG synaptic wiring. d, dmd-5 mutation suppresses the ectopic 
PHB-AVG synapses in AVG-masculinized animals. e, The PHB-AVA 
connection is non-autonomously partially stabilized in dmd-5; dmd-11 
mutants. f, dmd-11 genomic locus and gk552 deletion location. 

g, dmd-5 genomic locus and mutation description. 0k 1394 location 

was not curated, to determine location we used the following primers: 
forward primer: 5‘-CAGAATGCCTGTTTCTCCGTC-3’; and reverse: 
5'‘-CACTGCTTTTCCCGTTCAAAC-3’. 0k 1394 and tm1760 were both 
found to have an embryonic lethal phenotype that could not be rescued 
by the genomic locus (data not shown); thus we searched for single-point 
mutations of the ‘million mutation project”. gk408945 is a missense 
substitution mutation of W54 to R, located in the second exon. Genomic 
analysis revealed that this mutation lies within the conserved DM 
domain (h.), with perfect conservation across evolution. h, DM-domain 


ARTICLE 


sequence conservation and location of gk408945 mutation. Conservation 
and multiple sequence alignment were performed using UCSC Genome 
Browser (http://genome.ucsc.edu) and ClustalW. The DM domain is an 
intertwined zinc-containing DNA binding module. The DM domain 
binds DNA as a dimer, allowing the recognition of pseudopalindromic 
sequences**. i, dmd-5 and dmd-11 are required for maintenance of 

AVG synapses. Fluorescent micrographs and quantification of synaptic 
puncta of LUA-AVG. Region of neurite overlap and observed synaptic 
puncta marked with white boxes. M, merge; N, neurite; P, puncta. 
Statistics were calculated using the nonparametric Mann-Whitney 

test (b, d, e, i) or Kruskal-Wallis test with Dunn’s multiple comparison 
test (c). ****P < 0.0001, ***P < 0.001, *P < 0.05; NS; not significant. 
Magenta horizontal bars represent the median. When using a parametric 
t-test, there is also a significant difference for the LUA-AVG synapse 
between dmd-5;dmd-11 mutant hermaphrodites and dmd-5;dmd-11 
mutant hermaphrodites that overexpress DMD-5 (*P< 0.05). 

j, Summary of data. TRA-1 and DMD proteins are commonly thought to 
work as transcriptional repressors”. As dmd-5 and dmd-11 are already 
dimorphically expressed in AVG in embryos and L1 stage animals (not 
shown in this schematic), there must be other timer mechanisms that 
control the onset of pruning. For example, DMD-5 and DMD-11 may 
work together with a regulatory factor of the stage-specifically acting 
heterochronic pathway. Furthermore, we hypothesize that other neurons, 
such as the AVA neuron, may have its own complement of sex-specific 
dmd genes that control pruning. 
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Interconnected microbiomes and 
resistomes in low-income human habitats 


Erica C. Pehrsson!*, Pablo Tsukayama!*, Sanket Patel!?, Melissa Mejia-Bautista’?, Giordano Sosa-Soto!*, Karla M. Navarrete®, 
Maritza Calderon‘, Lilia Cabrera°, William Hoyos- Arango’, M. Teresita Bertoli*, Douglas E. Berg®’, Robert H. Gilman*>* & 


Gautam Dantas!?»®? 


Antibiotic-resistant infections annually claim hundreds of thousands of lives worldwide. This problem is exacerbated by 
exchange of resistance genes between pathogens and benign microbes from diverse habitats. Mapping resistance gene 
dissemination between humans and their environment is a public health priority. Here we characterized the bacterial 
community structure and resistance exchange networks of hundreds of interconnected human faecal and environmental 
samples from two low-income Latin American communities. We found that resistomes across habitats are generally 
structured by bacterial phylogeny along ecological gradients, but identified key resistance genes that cross habitat 
boundaries and determined their association with mobile genetic elements. We also assessed the effectiveness of widely 
used excreta management strategies in reducing faecal bacteria and resistance genes in these settings representative of 
low- and middle-income countries. Our results lay the foundation for quantitative risk assessment and surveillance of 
resistance gene dissemination across interconnected habitats in settings representing over two-thirds of the world’s 


population. 


Antibiotic resistance in bacterial pathogens causes hundreds of thousands 
of annual fatalities globally’. The spread of resistant organisms and their 
antibiotic resistance genes occurs by direct contact between humans and 
via interactions with environmental microbiota*°. Horizontal gene trans- 
fer (HGT) facilitates this dissemination, and the reservoir of antibiotic 
resistance genes (the ‘resistome’”) in the environment, from which path- 
ogens could theoretically draw, is ancient, diverse, and widespread®"”, 
Characterizing resistome distributions and their potential for dissemi- 
nation across diverse habitats can identify the microbiota and antibiotic 
resistance genes that pose the highest risks to human health. 

Most resistome studies have focused on either industrialized!!-1° 
or remote, ‘pristine’ settings®”!+, However, most of the world’s people 
reside outside these extremes: ~5.8 billion live in low- and middle- 
income countries!°, with 863 million people living in slums!°. Here, 
we characterized the microbiomes and resistomes of human faecal 
and co-localized, ecologically diverse environmental microbiota from 
two low-income, resource-limited Latin American settings: (1) a rural 
village of subsistence farmers in El Salvador (RES) and (2) a peri-urban 
shanty-town (slum) in Lima, Peru (PST), which represent critically 
understudied microbial ecosystems (Extended Data Fig. 1a). Aspects 
of life in such settings that are distinct from both hunter-gatherer and 
industrialized populations include crowding (in PST), limited access 
to clean drinking water and sanitation, supplementation of personally 
grown produce and livestock with processed foods, and ready access 
to antibiotics without prescriptions'’. Furthermore, such industrial- 
izing countries are responsible for the majority of the worldwide 36% 
increase in antibiotic use between 2000 and 2010 (ref. 18), making 
investigation of antibiotic resistance transfer in these settings a global 
public health priority. 

We analysed 263 faecal samples from 115 individuals in 27 houses 
over two years from RES and PST, as well as 209 environmental samples 
from donor households and surrounding areas in these communities. 


The environmental samples included faeces from domesticated 
animals, soil, water, and samples from the sanitation facilities of each 
community: composting latrines in RES and a district-wide sewage 
system with a modern wastewater treatment plant (WWTP) in PST. 
We used a combination of 16S sequencing'®'’, high-throughput 
functional metagenomic selections!°”°”!, and whole-metagenome 
shotgun sequencing'!”” to compare the phylogenetic architectures of 
these microbial populations and their associated resistomes. 


Resistome correlates with phylogeny across habitats 
Small-insert metagenomic expression libraries in Escherichia coli 
constructed from 51 human faecal and 45 environmental samples 
from RES and PST (representing 258 Gb) were selected for functional 
resistance against 17 antibiotics (Supplementary Table 1). Sequencing 
and annotation*”? of these selections identified 1,100 unique (100% 
amino acid identical) encoded antibiotic resistance proteins collec- 
tively conferring resistance against all antibiotics except meropenem 
(see Methods; Supplementary Table 2). A total of 121 of these proteins 
were novel (<60% amino acid identity to any protein in NCBI nr), the 
majority of which (72%) were predicted antibiotic modifiers, including 
57 class A B-lactamases. RES latrine libraries yielded the most novel 
proteins (46%), proportionally more than expected (Pearson's chi- 
squared test, P<2 x 10~°; Extended Data Fig. Ic). 

To further characterize RES and PST resistome diversity and abun- 
dance, we performed whole-metagenome shotgun sequencing on 191 
human faecal and 94 environmental samples (representing 344 Gb; 
Extended Data Fig. 1b; see Methods). We used ShortBRED”‘ to quan- 
tify translated antibiotic resistance gene abundance in all sequenced 
metagenomes using a custom antibiotic resistance database that 
included antibiotic resistance genes identified here (see Methods; 
Supplementary Tables 3-5). RES and PST human-associated and 
environmental resistomes were related along an ecological gradient 
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in terms of input from human faeces (Bray—Curtis; Extended Data 
Fig. 2a), with habitat explaining 22.4% of resistome variation (adonis, 
P<0.001, Bray—Curtis). 

A similar ecological gradient by habitat was observed when consider- 
ing phylogenetic composition, based on 16S sequencing of 228 human 
faecal and 203 environmental samples from RES and PST (Extended 
Data Fig. 1b, see Methods), with habitat explaining even more of the 
variation between samples (41.9%; adonis, P < 0.001, weighted UniFrac; 
Extended Data Fig. 2b). Procrustes analysis confirmed that antibiotic 
resistance is significantly correlated with community composition 
(Bray-Curtis, M? =0.360, P< 0.001; Extended Data Fig. 2c), not 
randomly distributed across habitats!””°. 


RES/PST versus global human faecal microbiota 

The human gut microbiota is highly diverse globally, with phylogenetic 
and functional variation potentially driven by age, diet, cultural tradi- 
tions, pathogen carriage, and periodic perturbation (for example, by 
antibiotic exposure)!”*. We compared the RES and PST human faecal 
microbiota to published microbiota data sets from the USA, Malawi, 
Venezuela, and Peru (Amazonian hunter-gatherers and Andean 
highlands agriculturalists) (see Methods)'*!”°, classifying PST as 
‘peri-urban industrializing’ and RES as ‘rural agriculturalist. Faecal 
microbiota composition clustered by host lifestyle'®°, despite differ- 
ences in geographic origin and study: RES microbiota clustered with 
other rural agriculturalists and hunter-gatherers, while peri-urban PST 
microbiota were intermediate between these and the urban industrial- 
ized cohorts (adonis R* = 23.8%, P< 0.001, weighted UniFrac; Fig. la 
and Extended Data Fig. 3). 

To evaluate whether antibiotic resistance burden correlated with 
industrialization!*”®?’, we compared resistomes from RES and PST to 
industrialized USA, traditional hunter-gatherer, and rural agricultur- 
alist communities from?°, two of which are also Peruvian (Extended 
Data Fig. 4a-f; see Methods). PST had the greatest number of antibiotic 
resistance proteins per person, despite not being the most industrialized 
cohort analysed (Fig. 1b and Extended Data Fig. 4g), consistent with 
Chinese and Hadza hunter-gatherer faecal microbiota harbouring at 
least as much or more antibiotic resistance as Western industrialized 
microbiota!!”. 


RES human and environmental microbiota 

Frequent contact with environmental reservoirs during subsistence 
farming and inadequate excreta management”® likely promote anti- 
biotic resistance exchange in rural settings. The RES community had 
only one source of filtered water, and contents of compositing latrines 
were used to fertilize household agricultural plots after attempted 
sterilization, potentially recycling antibiotic resistance. Accordingly, 
we compared the microbiomes and resistomes of RES human faecal 
samples and their surroundings, including latrines, animal faeces, soils, 
and drinking water sources (see Methods). 

The RES human faecal microbiota separated from soil and water 
along PC1 in a principal coordinate analysis (PCoA) visualization 
of their phylogenetic composition, and habitats differed at the phy- 
lum level (Fig. 2a, Extended Data Fig. 5a and Supplementary Tables 
6-10). Latrines were equidistant to human faecal microbiota and soil 
(weighted UniFrac, non-parametric Student's t-tests with Bonferroni 
correction) and were enriched for Halomonadaceae, a family of hal- 
ophilic organisms potentially selected by the alkaline latrine environ- 
ment (Extended Data Fig. 5b, Supplementary Discussion). Cow and 
dog faecal microbiota, which are ecologically similar to human faecal 
microbiota but with greater environmental exposure, were also inter- 
mediate along PC1. Soil and water were closest to each other (P< 0.05, 
non-parametric Student's t-tests with Bonferroni correction) and varied 
along PC3 (6.3% of variation). Human faecal microbiota were more 
homogenous in phylogenetic composition than latrine, soil, and water 
microbiota, potentially because environmental samples encountered 
more diverse and variable conditions. 
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Figure 1 | RES and PST human faecal microbiota and resistomes versus 
global populations. a, PCoA of weighted UniFrac distances between 

RES (n= 60) and PST (n = 46) microbiota and published human faecal 
microbiota!*!*?° (n = 446; see Supplementary Table 14), coloured by host 
lifestyle. Adonis R?=15.4%, P< 0.001. b, Number of antibiotic resistance 
proteins per person in RES (n= 42) and PST (n= 44) and published human 
faecal microbiota (ref. 25) (n =53; see Supplementary Table 15), coloured by 
host lifestyle. Error bars, s.d.; centre bars, median. *P < 0.05, non-parametric 
Student’s t-tests, Bonferroni correction. AR, antibiotic resistance. 


The RES resistomes exhibited a similar ecological gradient (Bray- 
Curtis, Fig. 2b). Although RES soil had the highest phylogenetic 
diversity (Fig. 2c), it contained fewer antibiotic resistance proteins 
per sample than all habitats but water (Fig. 2d). In contrast, human 
faecal microbiota had the lowest phylogenetic diversity, but more 
antibiotic resistance proteins per sample than both soil and water. 
However, soil and latrine resistomes were more heterogeneous than 
human and animal faecal resistomes. In particular, non-human RES 
resistomes were enriched in aminoglycoside nucleotidyltransferases, 
class D 8-lactamases, SMR transporters, and resistant dihydropteroate 
synthetases and dihydrofolate reductases (Fig. 2b, Extended Data 
Fig. 5c, d; see Methods). 

By analysing RES habitats prone to exchange with human faeces 
using SourceTracker?’, we found that direct input from chicken 
faeces represents a potent avenue for the introduction of antibi- 
otic resistance genes compatible with the human faecal micro- 
biota into soil (see Methods). The contribution of RES human 
faecal resistomes to environmental resistomes (Fig. 2f) roughly 
recapitulated phylogenetic trends, with significantly higher input to 
latrines than soil or water (Fig. 2e; see Supplementary Information). 
Only the two soils collected adjacent to chicken coops had >1% 
human faecal resistome input. These two samples clustered 
away from the other samples along PC3 (5.3% of variation) and 
were closer to human samples along PC1 in the PCoA visualization of 
all RES samples (Fig. 2a). Overall, RES human faecal microbiota were 
closer to soil collected near chicken coops than to any other soil location 
in both phylogenetic and antibiotic resistance composition (Bray- 
Curtis, non-parametric Student's t-tests with Bonferroni correction). 
They shared 80 antibiotic resistance proteins at >99% identity, including 
three class C 8-lactamases, which are common in Enterobacteriaceae, 
in contrast to two antibiotic resistance proteins shared between RES 
humans and soil from the pond edge. On average, human faecal and 
chicken coop soil resistomes shared 10 antibiotic resistance proteins, 
significantly more than with any other soil type (1-2 antibiotic resistance 
proteins, non-parametric Student’s t-tests with Bonferroni correction). 


PST human and sewage microbiota 

To monitor the impact of sewage treatment on resistomes, we sam- 
pled wastewater at street-level access points nearest to participating 
PST households and influent and effluent from the local WWTP 
and compared their phylogenetic composition and resistome to the 
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1. ABC transporter 

2. Aminoglycoside nucleotidyltransferase 
3. Class D B-lactamase 

4. MATE transporter 

5. Novel ciprofloxacin AR gene 


AR annotations 

6. Resistant DHFR 

7. Resistant DHPS 

8. Ribosomal protection 
9. RND transporter 

10. SMR transporter 
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Figure 2 | Salvadoran rural agriculturalist (RES) human faecal 

and environmental microbiota and resistomes. a, c, e, Phylogenetic 
composition (n= 60, 6, 36, 84 and 22 for human, animal, latrine, soil and 
water, respectively). b, d, f, Resistome (n = 42, 4, 16, 30 and 4 for human, 
animal, latrine, soil and water, respectively). a, PCoA of weighted UniFrac 
distances between microbiota. Adonis R* = 43.2%, P< 0.001. b, PCoA 

of Bray-Curtis distances between resistomes with abundance-weighted 
coordinates of the top five most discriminative antibiotic resistance 
categories enriched in human or non-human habitats (squares, size 
proportional to overall abundance). Adonis R? = 26.6%, P < 0.001. 

c, d, *P <0.05, non-parametric Student’s t-tests, Bonferroni correction. 

c, Faith's phylogenetic diversity. d, Observed antibiotic resistance proteins. 
e, f, Percentage of latrine, soil, and water microbiota (e) and resistomes (f) 
attributable to human faeces, as determined by SourceTracker”’. *P < 0.05, 
pairwise Wilcox tests, Bonferroni correction. Error bars, s.d.; centre 

bars, median. 


faecal microbiota of PST residents (see Methods and Supplementary 
Information). Portions of treated wastewater effluent are discharged 
into the Pacific Ocean and also used to irrigate public parks and agricul- 
tural fields, potentially enabling re-introduction of antibiotic residues 
and antibiotic resistance genes into haman communities*”*”. 

Although geographically closest to human donors, street- 
access sewage was more similar in microbial composition to WWTP 
influent (non-parametric Student’s t-tests with Bonferroni correction, 
P<0.001), implying that even relatively brief periods in this non-human, 
aerobic environment caused a greater shift in bacterial composition 
than all changes downstream during transit through the sewage system. 
Overall, PST human waste underwent drastic changes in microbial 
composition as it progressed through sewage treatment, decreasing 
in similarity to human faecal microbiota at each subsequent stage 
(Student’s non-parametric t-tests with Bonferroni correction, P< 0.001, 
weighted UniFrac; Fig. 3a, Extended Data Fig. 6a, b and Supplementary 
Table 11). 

Sewage resistomes also decreased in similarity to PST human faecal 
resistomes at each treatment stage, although street-access sewage and 
WWTP influent were equally similar to human faeces in antibiotic 
resistance content (non-parametric Student's t-tests with Bonferroni 
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Figure 3 | Peruvian peri-urban slum (PST) human faecal and sewage 
microbiota and resistomes. a, c, e, Phylogenetic composition (n = 45, 16, 
14 and 13 for human, street-access, influent and effluent, respectively). 

b, d, f, Resistome (n = 44, 14, 13 and 7 for human, street-access, influent 
and effluent, respectively). a, PCoA of weighted UniFrac distances between 
microbiota. Adonis R? = 58.0%, P< 0.001. b, PCoA of Bray—Curtis 

distances between resistomes with abundance-weighted coordinates of the 
top five most discriminative antibiotic resistance categories enriched in 
human or sewage habitats (squares, size proportional to overall abundance). 
Adonis R? = 32.3%, P< 0.001. c, d, *P< 0.05, non-parametric Student’s 
t-tests, Bonferroni correction. ¢, Faith’s phylogenetic diversity. d, Observed 
antibiotic resistance proteins. e, f, Percentage of sewage microbiota (e) and 
resistomes (f) attributable to human faeces at each sewage treatment stage, as 
determined by SourceTracker. *P < 0.05, pairwise Wilcox tests, Bonferroni 
correction. Error bars, s.d.; centre bars, median. 


correction, Bray—Curtis; Fig. 3b). Although WWTP effluent had the 
highest phylogenetic diversity (Fig. 3c), it had the fewest antibiotic 
resistance proteins per sample (Fig. 3d), as observed for soil in RES. 
In contrast, street-access sewage and WWTP influent had both higher 
phylogenetic diversity and more antibiotic resistance proteins per sample 
than PST human faeces. Drug efflux antibiotic resistance mechanisms 
were higher overall in PST human faecal vs sewage resistomes (P < 0.05, 
pairwise Wilcox tests with Bonferroni correction; Fig. 3b and Extended 
Data Fig. 6c, d). In contrast, sewage was enriched for aminoglycoside 
acetyltransferases, class D 3-lactamases, and resistant dihydrofolate 
reductases. 

As expected, the human faecal contribution to WWTP effluent 
phylogenetic composition was lower than for street-access sewage 
or WWTP influent (P < 0.05, pairwise Wilcox tests with Bonferroni 
correction; Fig. 3e). Interestingly, this difference was not significant 
for resistomes, where human faecal input was high at all sewage 
treatment stages (P > 0.05; Fig. 3f). Thus, although sewage treatment 
was successful in reducing the overall carriage of antibiotic resistance 
genes as well as depleting human faecal microbes, antibiotic resistance 
genes of faecal origin survived and could be reintroduced into the natural 
environments where effluent is discharged. 
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Figure 4 | Antibiotic resistance proteins found 
in multiple habitats and genetic contexts in 
RES and PST. a, Representative alignment of 5 
of 25 contigs encoding a TEM-type B-lactamase 
at 99.9% nucleotide identity (full list of contigs in 
Methods). Contigs were annotated with Resfams 
v1.2. Source metagenomic libraries are indicated 
on the left. b, Antibiotic resistance networks 
between human and environmental metagenomes 
in RES and PST. Small nodes (squares) represent 
unique antibiotic resistance proteins found in 

at least one sampled metagenome, coloured by 
predicted resistance mechanism. Large nodes 
represent individual human/animal (circle) or 
environmental (triangle) metagenomes, coloured 
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As the presence of antibiotics in sewage systems may influence 
resistome diversity and select for HGT, we used a modified solid phase 
extraction and mass spectrometry protocol** to detect 16 antibiotics 
from seven classes in 22 WWTP influent and effluent samples. 
Chloramphenicol, ciprofloxacin, tetracycline, trimethoprim, and sul- 
famethoxazole were consistently detected in influent throughout the 
sampling period, while erythromycin was detected in 36% of influent 
samples. Only sulfamethoxazole was detected in effluent samples 
(concentrations of 18-26,000 1g 1~!) (Supplementary Table 12). 
Antibiotics detected in WWTP samples were among the highest selling 
antibiotics in Peru'® and may enrich for antibiotic resistance in these 
bacterial communities and in those exposed to effluent. However, 
no (3-lactams were detected in any WWTP sample, despite the high 
abundance of 8-lactam resistance genes found in our metagenomic 
surveys and amoxicillin being the highest-selling antibiotic in Peru’®. 
This suggests that 6-lactam antibiotics are degraded to undetectable 
levels in humans or the sanitation system before reaching the WWTP. 


Highly cosmopolitan AR and HGT across microbiota 

Although resistome and phylogenetic composition appear tightly linked 
in most microbial communities!””’, some clinically relevant antibiotic 
resistance proteins (for example, TEM, CT X-M, KPC, AAC-6’) 
have been extremely successful in global dissemination via clonal 
expansion and HGT between multiple pathogen hosts**. We identified 
highly cosmopolitan antibiotic resistance proteins by comparing the 
prevalence of genes encoding them across all RES and PST habitats 
(Extended Data Fig. 7a). Two proteins, including the sulfonamide- 
resistant dihydropteroate synthetase (DHPS) Sul2, were found in 50% 
of samples in six of seven habitats. On our functional metagenomic 
contigs, ten of the twelve DHPSs were >98% amino acid identical to 
Sull, Sul2, or Sul3 and were co-localized with integrases and numerous 


by habitat/cohort. Lines connecting samples 
and antibiotic resistance proteins represent 

a ShortBRED hit with an RPKM (reads per 
kilobase per million reads) of > 10, coloured by 
mechanism. 


other antibiotic resistance genes, suggesting multidrug-resistance inte- 
grons may facilitate their broad distribution in these settings*”. 

To further investigate antibiotic resistance exchange potential in 
RES and PST, we examined flanking genetic sequences in the contig 
assemblies from our functional selections for evidence of past HGT. 
A total of 120 (11%) of our unique antibiotic resistance proteins were 
encoded in more than one genetic context (contigs with <90% local 
nucleotide identity; Extended Data Fig. 7b), and the number of contexts 
was positively correlated with the number of metagenomic libraries 
(Spearman’s rho =0.59, P< 2.2 x 10~'°) and habitats (Spearman's 
rho =0.47, P< 2.2 x 107!°) in which an antibiotic resistance protein 
was encoded. One TEM 8-lactamase (TEM-1) was encoded in 25 con- 
texts (Fig. 4a). In contrast, 41% of antibiotic resistance proteins found 
in multiple habitats were always encoded in the same genetic context. 
For instance, a TetX (a tetracycline-inactivating enzyme) variant was 
encoded in the same context in human and animal faeces, latrines, and 
sewage influent. Three of the six antibiotic resistance proteins encoded 
by both human faecal and soil microbiota were encoded in the same 
genetic context (CblA and TEM class A 6-lactamases and a class D 
3-lactamase). 

We revealed a large network of antibiotic resistance gene sharing 
between microbial communities of human, animal, and environmental 
origin (Fig. 4b), facilitated by HGT between bacterial genomes and 
spread of bacterial hosts across communities. To further assess the 
potential mobility of the antibiotic resistance genes found in our con- 
tigs, we identified putative mobile genetic elements (MGEs) and multi- 
drug resistance clusters (MDRCs) by annotation (Supplementary Table 
13; see Methods). There was a small but significant positive correlation 
between the proportion of antibiotic resistance contigs with an MGE 
or MDRC and the number of libraries and habitats in which the anti- 
biotic resistance protein was encoded (Spearman’s rho = 0.11-0.17, 
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P<4,3 x 10~“), which supports a role for MGEs and MDRCs in anti- 
biotic resistance transfer across environments and increased accessi- 
bility to pathogens**. When ecological analyses were restricted to only 
antibiotic resistance genes adjacent to an MGE on one of our functional 
contigs, the results largely recapitulated the trends observed with the 
full antibiotic resistance set, with resistome correlating with phyloge- 
netic composition across ecological habitats. Additionally, the RES 
human faecal contribution was reduced in soil compared to latrine 
resistomes, while the contribution of PST human faeces to sewage was 
not significantly different before or after treatment (Supplementary 
Discussion and Extended Data Fig. 8). 


Conclusions 

Our characterization of human faecal and environmental microbiota 
and their resistomes from two low-income settings in Latin America 
is particularly relevant to global public health. Billions of people 
currently live in rural or transitional areas around large urban centres, 
where unregulated access to antibiotics and limited access to clean 
water and improved sanitation increase the risk of pathogen transmis- 
sion. Future studies on the factors that promote or restrict antibiotic 
resistance exchange between environmental microbiota, human 
commensals, and pathogens, particularly during waste treatment, 
are merited. These would involve real-time molecular surveillance of 
‘high-risk’ environments (for example, hospitals, large-scale animal 
feeding operations) to identify specific routes for the spread of resist- 
ant bacteria and antibiotic resistance genes and inform the design 
of public health interventions to decrease their global enrichment and 
dissemination. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 


Site and population overview. The community sampled in Lima (PST) is a ‘pueblo 
joven (or shantytown) built on the slopes of the desert hills ~15 miles southwest 
of Lima, Peru (Extended Data Fig. 1a). The area was largely settled in the 1980s; 
the 2007 census recorded 56,915 inhabitants in an area of ~5km?, a population 
density approximately four times higher than the rest of the Lima Metropolitan 
Area. Although accurate estimates of socioeconomic status were not available, the 
community is considered low income (although not uniformly) with an average 
family income of ~$130 per month. Although the majority of the households 
have access to electricity, water and sewage, many households in the poorer hilltop 
settlements still lack access to these services. Most households are now linked to a 
district-wide sewage system that funnels waste from PST and neighbouring com- 
munities to the nearby ‘San Juan WWTP, one of 19 treatment plants serving the 
city of Lima. Sewage influent is collected in large aeration lagoons and subjected 
to stages of oxidation, settling, and chlorination in subsequent lagoons. Diarrheal 
diseases are common in this community, in particular among children, with an 
average of three episodes of diarrhoea per year due to high prevalence of various 
bacterial, viral and protozoan infectious agents*”-*°. Commonly used antibiotics 
are commonly purchased without prescription at the local pharmacies. 

The rural Salvadoran village (RES) is home to approximately 100 people and is 

located in the mountains outside of San Salvador. Most individuals are subsistence 
farmers, although some are employed outside the village. They also participate in 
communal small-scale commercial enterprises, such as fish cultivation. Although 
the community has benefitted from infrastructure improvements, including pre- 
fabricated houses, drinking water is available only from a sand-filtered communal 
tap. Drinking water is stored in containers, and washing of dishes and clothing is 
primarily performed with unfiltered water or rainwater stored in outdoor barrels or 
reservoirs, which can be visibly contaminated with plant material. Each household 
has a double-vault composting latrine, a recommended method of waste disposal 
in low-income areas”, but which was available to only ~14% of rural Salvadorans 
in 2013 (ref. 41). Of the almost half (47%) of all people who live in rural areas 
worldwide’, 16% do not have drinking water sources protected from contamination 
with human excreta, and 50% lack sanitation facilities that separate excrement from 
human contact”®. Urine is diverted away from the latrine, and wood ash is added 
to the latrine compartment after each use to increase the internal pH. After the 
compartment is full, it is sealed off to allow heat, desiccation, and alkalinization to 
destroy faecal microbes. Sterilized waste is then spread onto household agricultural 
plots as fertilizer, but sterilization may be incomplete’’. The village is a two-hour 
walk from the nearest primary health care centre, but many antibiotics are available 
for purchase over-the-counter. Depending on the season (rainy or dry), villagers 
consume a combination of food grown in individual household plots and food 
purchased from the town. They primarily consume beans, as well as starches such 
as tortillas and rice, and rarely consume meat. Chickens and cows are the most 
common domestic animals. 
Study design. In the Salvadoran village (RES), the study was explained to the com- 
munity in a public forum at a preliminary visit, and all members of the community 
were invited to participate. In the Peruvian community (PST), ten households 
with a minimum of four members and one child <10 years old were randomly 
selected and invited to participate in the study. All individuals living in the same 
household were invited, but were not required, to participate. Written informed 
consent from each participant was obtained before asking survey questions or 
faecal sample collection. Both studies conform to the guidelines set forth in the 
Helsinki Declaration. Prior to initiation, the studies were reviewed and received 
approval from the National Ethics Committee of El Salvador (Comité Nacional de 
Etica para la Investigacién, Acta number 039-2012), the Institutional Review Board 
of Asociacion Benéfica Prisma in Lima (Act CE0809.12), and The Washington 
University in St Louis Institutional Review Board (IRB ID numbers 201301049/ 
201206094). 

After the initial collection in January 2013, human faecal samples from RES were 
collected one week, three weeks, and one year later. Human faecal samples from PST 
were collected between January and May 2012. We sampled 3.6 + 2.0 individuals 
per house in RES and 5.7 + 3.0 in PST. Household environmental samples (soil, 
water, latrine compartments, and animal faeces from dogs, cows, chickens, guinea 
pigs) were collected with the permission of the residents. In RES, we sampled soils 
from washing areas adjacent to each house, adjacent to the latrine compartment 
where treated waste is removed, from urine-diverting tube exits, and where possible, 
from agricultural plots and chicken coops, as well as mud from the community 
pond’s edge. We also collected water from the piped source of sand-filtered drinking 
water, stored drinking water, wash water from outdoor storage barrels, reservoirs, 
and other containers, and the community pond and its inflow. 

Sewage influent (pre-treatment) and effluent (post-treatment) samples from the 
wastewater treatment plant ‘PTAR San Juan; located in the vicinity of PST, were 
collected in collaboration with Water and Sewage Authority of Lima (SEDAPAL). 
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Twelve influent and effluent samples were collected between May 2012 and January 
2013 to assess the stability of sewage communities over time. We collected influent 
and effluent from the districts of San Juan de Miraflores (SJM; which PST is part 
of) and Villa El Salvador (VES; a neighbouring pueblo joven with similar demo- 
graphic characteristics), which converge in this WWTP and serve a population 
of ~700,000. 

RES demographic survey. In RES, a survey was administered to each study par- 
ticipant at the first three sample collections inquiring about household structure, 
occupation, diet, and medications, among other topics. Self-reported answers were 
used to determine participant age, household association, and frequency of travel 
outside the community, as well as latrine and animal ownership for each household. 
DNA extraction. Samples were collected in sterile containers, immediately frozen, 
and stored at —20 to —80°C until shipment to Washington University in St Louis, 
USA where samples were stored at —80°C until DNA extraction. Metagenomic 
DNA was extracted from approximately 400 to 600 mg of each faecal and latrine 
sample with the phenol-chloroform bead-beating protocol described previously“. 
Metagenomic DNA was extracted from soil and latrine samples with high ash con- 
tent using MO BIO PowerSoil and PowerMax Soil DNA isolation kits. Water sam- 
ples were filtered with sterile 0.22 |1m filters, and metagenomic DNA was extracted 
from the filter membranes using the MO BIO PowerWater DNA isolation kit. For 
PST sewage influent and effluent, 50 ml of sample was centrifuged at 10,000 r.p.m. 
for 10 min; pellets were resuspended in 0.5 ml PBS, transferred to microcentrifuge 
tubes, and metagenomic DNA was isolated using the phenol-chloroform extrac- 
tion protocol used for faecal samples. Although extraction method does have an 
effect in metagenome studies, large differences in community composition (such 
as those found between different microbial habitats) have a greater influence on 
variation between samples”. 

Functional metagenomic selection. Small-insert shotgun expression libraries 
were created from metagenomic DNA in the vector pZE21 in E. coli DH10B 
as previously described*™. 68 libraries were created from 51 human faecal and 
45 soil, latrine, sewage, and animal faecal metagenomes. Libraries were created 
for all human faecal samples (1 = 20) and for soil, latrine, and cow faecal samples 
from two houses in RES (n= 14). House 4 was a family of four adults, and House 
6 was a family of two adults and four children where one parent routinely worked 
outside of the community. Libraries were also created from latrine samples from 
four additional houses (n= 4). Six of the human faecal libraries, one soil library, 
and one animal faecal library from RES were each created from two pooled metage- 
nomes from the same individual/location at different time points. Libraries were 
created from human (n= 31) and animal (n=5) faecal samples from four houses in 
PST, as well as pooled sewage treatment plant influent and effluent. Metagenomic 
DNA from thirteen sewage influent and nine effluent samples were combined into 
a separate pool for each stage before library creation and selection because of low 
per-sample DNA yield. 

Libraries were screened as previously described on Mueller-Hinton agar 
containing 50j1g ml! kanamycin and another antibiotic at concentrations inhibitory 
to E. coli DH10B harbouring pZE21 without an insert (Supplementary Table 1). 
The surviving colonies for each selection were pooled. Metagenomic inserts from 
each pool were isolated via PCR with vector-specific primers, barcoded, and 
sequenced in parallel with the Illumina HiSeq 2000 (2 x 101 or 2 x 150 bp reads). 
Reads were demultiplexed by barcode, assembled into contigs with PARFuMS*%, 
and annotated with Resfams v1.2 (ref. 23). 

A selection was excluded from analysis of antibiotic resistance if: (a) more than 
100 contigs were assembled; (b) the number of contigs assembled was more than 
ten times the number of colonies on the selection plate. 

With these criteria, 16 out of 568 selections (2.8%) were excluded. Antibiotic 
resistance genes were identified by Resfam annotation (Supplementary Table 1). 
Ifa core, hand-curated, Resfam annotation specific to the antibiotic class was present 
on a contig, it was preferentially considered the causative resistance gene. Other 
Resfam annotations plausible for that antibiotic class were then identified from 
any contig that did not already contain a resistance gene. With this method, less- 
specific annotations such as efflux pumps were only identified as resistance 
genes if they were not co-localized with an antibiotic class-specific, canonical 
resistance gene. 

Lipopolysaccharide modification is a conserved mechanism of antimicrobial 
peptide resistance in the phylum Bacteroidetes”*. In the colistin selections, 25 ORFs 
were annotated as ‘PAP2 superfamily’ (PF01569.16) and shared 30.4-38.5% 
global amino acid identity with the Bacteroides thetaiotaomicron resistance gene 
IpxF (AAO76961.1)*°. These open reading frames (ORFs) were also considered 
antibiotic resistance. 

To confirm the function of the ciprofloxacin resistance gene from library 
01C_014, the plasmid was isolated from the resistant colony and reintroduced 
into a susceptible strain of E. coli, and the transformed cells were confirmed to 
grow in liquid and solid media containing 0.5 1g ml“! ciprofloxacin at 48 and 72h 
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after inoculation, which was not observed for the negative control. The insert was 
amplified from the plasmid via PCR and Sanger-sequenced from both ends of the 
pZE21 vector. The combined Sanger sequence was manually trimmed to remove 
vector sequence, and the resulting contig (1,043 bp) was annotated with Resfams 
v1.2. The contig and its single ORF were included with the resistant contigs and 
ORFs identified by annotation above. 

A total of 2,075 antibiotic resistance ORFs were identified on 1,955 contigs. The 
ORFs and the MetaGeneMark-generated protein sequences for each resistance 
ORF were each clustered at 100% identity over the entire length of the shorter 
sequence to identify identical sequences, collapsing to 1,245 unique (100% nucle- 
otide identical) ORFs and 1,100 unique (100% amino acid identical) proteins. The 
contigs were clustered at 90% local identity (cd-hit-est parameters: -c 0.9 -d 0 - 
r1-G0-n8 -uS 0.05 -aS 0.5) to identify different genetic surroundings. 
Identification of top hits in NCBI nr. The MetaGeneMark-generated protein 
sequence for each ORF was compared to NCBI nr (accessed on 15 September 2014) 
with blastp to identify the top local hit(s). A Needleman-Wunsch alignment was 
generated with EMBOSS needle for each protein and top hit (default parameters), 
and the global percent identity was calculated as the number of identities over the 
length of the shorter sequence. 

Whole metagenome shotgun sequencing. Metagenomic DNA was sheared 
to 300-400 bp, barcoded by sample, and sequenced on an Illumina HiSeq or 
NextSeq with 2 x 150 bp paired reads. Reads were demultiplexed by barcode with 
no mismatches, retaining reads whose pair did not contain the same barcode 
as unpaired reads. Demultiplexed reads were trimmed with Trimmomatic-0.30 
to remove Illumina adaptor and low-quality bases (<Q13) from the ends, with 
default ILLUMINACLIP parameters and a minimum trimmed read length of 
36 bases. Paired reads were trimmed in palindrome mode, while single reads 
were trimmed in simple mode. Human sequences were removed with DeconSeq 
by mapping to the human reference genome (GRCh38)””. Any paired read whose 
pair was a human sequence was also removed. Samples with fewer than 1 million 
total reads (paired and unpaired) were excluded from further analysis. 
85 of the 98 samples interrogated with functional metagenomics were shotgun 
sequenced. 

Assembly of metagenomes from low-diversity metagenomes. Nine shotgun 
metagenomes from children <3 years old with 150 OTUs/sample or fewer based 
on 16S data (see below) were assembled using Velvet (Supplementary Table 3). 
VelvetOptimiser was run on each sample with hash values from 19 to 141 in steps 
of 2, with both paired and unpaired reads, using n50 as the optimization function. 
Assembled contigs were annotated with ResEams”*, and ORFs with core ResFams 
annotations were included in the ShortBRED markers. 

Quantification of antibiotic resistance genes in metagenomes with ShortBRED. 
ShortBRED”‘ was used to quantify the abundance of antibiotic resistance genes in 
the metagenomes. ShortBRED identifies unique marker sequences for clustered 
proteins that distinguish them from close homologues and maps reads to only 
those markers. This technique has greater accuracy than mapping to the entire 
protein, especially for antibiotic resistance genes, many of which evolved from 
genes performing non-resistance functions in the host. 

ShortBRED markers were identified from the antibiotic resistance proteins 
(1) isolated from the functional selections performed in this study (2,075) (2) 
identified from the human faecal metagenome assemblies in this study (132), (3) 
the Comprehensive Antibiotic Resistance Database (CARD) (downloaded 20 
October 2014; 2,972 proteins)**, and (4) the Lahey 3-lactamase database (http:// 
www.lahey.org/studies/; 1,145 proteins; one short protein, VEB-6, removed)*? 
(Supplementary Table 4), clustered at 100% identity. The reference database was 
the modified version of the Integrated Microbial Genomes database, version 3.5., 
described in ref. 24. ShortBRED produced 2,275 markers when clustered at 100% 
identity (Supplementary Table 5) and 1,266 markers when clustered at 90% identity. 
Unless noted, the 100% identity markers were used for all analyses. 

We quantified translated antibiotic resistance gene abundance in all metagen- 
omes by mapping paired and unpaired fastq reads to the ShortBRED markers with 
99% sequence identity. This extended our resistome investigation to individuals 
and sites not interrogated using functional metagenomics and to antibiotics that 
target Gram positive bacteria (for example, vancomycin, macrolides) and are not 
detectable in functional selections in our Gram negative E. coli host®®. All analy- 
ses were performed on marker abundances normalized to reads per kilobase per 
million reads (RPKM). 

For antibiotic resistance proteins from the CARD and Lahey databases, metadata 
(resistance category, mechanism of action, antibiotic target(s)) was hand- 
curated from information available on the CARD website. For antibiotic resist- 
ance proteins identified through functional metagenomic selections and shotgun 
assemblies, resistance category and mechanism of action were assigned based on 
Resfams annotation. Antibiotic target(s) for the former were the antibiotics to 
which they conferred resistance in our functional selections, while the latter were 


not assigned an antibiotic target. Annotations for ShortBRED markers were drawn 
from all constituent proteins. 

Comparison of human faecal resistomes to published cohorts. For comparison 
to the resistomes in ref. 25, whole metagenome shotgun reads were downloaded 
from SRA (accession PRJNA268964). Fastq reads were trimmed with Trimmomatic 
in simple mode using the same parameters as for the RES and PST reads. Samples 
with fewer than 1 million total reads and individuals <3 years old were excluded. 
ShortBRED markers were quantified as above. For ref. 25, the average read length 
of the paired reads for each sample was specified as the average read length during 
ShortBRED quantification. Only ShortBRED markers that included proteins from 
the CARD and Lahey databases were considered for this analysis to avoid bias 
towards our cohorts. 

16S gene V4 amplification, sequencing, and preprocessing. The 16S gene V4 
region (515-806) was amplified using the original Earth Microbiome Project 
(EMP) protocol (http://www.earthmicrobiome.org/emp-standard-protocols/16s/) 
or with Takara Taq DNA polymerase premix, with barcoded primers designed 
in ref. 51, from 228 human faecal and 203 environmental samples. 245 samples 
interrogated by whole metagenome shotgun sequencing were also 16S sequenced. 
Barcoded amplicons were pooled and sequenced on an Illumina MiSeq with 
2 x 250 bp paired-end reads. 

Barcoded Illumina reads were demultiplexed with QIIME version 1.8.0, 
split_libraries_fastq.py°*. Paired reads were truncated at the first base with quality 
score < Q3 and merged using usearch®™, requiring 100% identity in the over- 
lap region and a merged length of 253 bp +5 bp. Merged reads were filtered 
with QIIME to remove reads with three or more contiguous bases with quality 
score < Q20. 

Open OTU picking with UPARSE for RES and PST comparisons. OTUs were 
picked from all 12,797,788 merged and filtered RES and PST reads with the 
UPARSE pipeline™. Singletons were excluded from OTU picking, and a reference- 
based chimaera check against the GOLD database (downloaded 1 September 2013) 
was performed on OTUs as recommended. Reads were assigned to OTUs at 97% 
identity using usearch. Representative sequences from each OTU were assigned 
taxonomy with uclust against the Greengenes database (release 13_8, 97% clusters), 
aligned, and used to create a phylogenetic tree using QUIME. 19,301 OTUs were 
picked across all samples. Biom tables were rarefied to 7,000 sequences per sample, 
which excluded eight samples. 

Comparison of phylogenetic composition of human faecal samples to pub- 
lished cohorts. Raw 16S amplicons from ref. 19 were downloaded from MG-RAST 
(accession number qiime:850). 16S amplicons from faecal samples from ref. 14 
were downloaded from the European Nucleotide Archive (ENA) (ERP008799). 
16S amplicons from ref. 25 were provided by the authors. For all studies, reads 
were generated with primers F515/R806 on an Illumina platform. We classified 
each cohort’s lifestyle as described in ref. 25. Only samples from individuals 
3 years or older were included in analysis. Individuals labelled with an adult- 
specific keyword (Family Member: “Mother”, “Father”, or Sample Identifier: “adlt”) 
in ref. 19 supplementary table 2 were also included. Samples from ref. 19 were 
rarefied to 50,000 reads each to reduce computational load. The merged reads from 
RES and PST human samples and the reads from ref. 14 were truncated at 101 bp. 

OTUs were picked from all reads with the reference-based protocol described 
in ref. 19 against the Greengenes database (13_8, 97% clusters). Reverse strand 
alignment was permitted, and new clusters were suppressed. The OTU table was 
rarefied to 5,000 sequences per sample, which excluded two samples from ref. 14. 
Sample filters. To better characterize within- and between-sample diversity, 
we limited both the shotgun and 16S analysis to a single sample per human 
individual except where noted. Infant faecal microbiota undergo large and frequent 
shifts in microbial composition before stabilizing into an adult-like configuration 
around three years of agel’. Therefore, except where noted, we limited all analyses 
with human samples to those from individuals 3 years old or older. 

Alpha diversity. For 16S, biom tables were rarefied ten times to 7,000 sequences 
per sample, and equitability, observed species, and Faith’s phylogenetic distance 
were calculated on all rarefactions and averaged using the QIIME script alpha_ 
diversity.py. For resistome, tables were not rarefied, and only the number of ARGs 
present in each sample was compared. Alpha diversity was compared between 
categories using non-parametric Student's t-tests with 999 permutations at a depth 
of 7,000 sequences per sample with Bonferroni correction for multiple hypothesis 
testing using the QIIME script compare_alpha_diversity.py. For Extended Data 
Fig. 4g, the total antibiotic resistance RPKM for each sample was summed, multi- 
plied by one million, and divided by the total number of reads for that sample to 
confirm that the results were robust to sequencing depth. 

Beta diversity. Unweighted UniFrac, weighted UniFrac, Sorensen-Dice, and 
Bray-Curtis dissimilarity matrices were calculated from biom tables with the 
QIIME script beta_diversity.py, using the Greengenes 13_8 97% phylogenetic tree 
(for closed reference OTU picking) and the phylogenetic tree generated during 
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de novo OTU picking for UniFrac distances. Principal coordinates analysis was 
performed with the QUME script principal_coordinates.py. 

ANOSIM and adonis analyses were performed on dissimilarity matrices with 
the QIUIME script compare_categories.py or in R (vegan package functions anosim() 
and adonis()) with 999 permutations. 

Distance to centroid was calculated in R with the vegan package function beta- 
disper(), and significance was tested with a permutation test (permutest()). 

Average pairwise dissimilarities between categories were compared with 
non-parametric Student's t-tests with 999 Monte Carlo permutations with 
Bonferroni correction using code adapted from the QUME script make_ 
distance_boxplots.py. For shared antibiotic resistance proteins, the number of 
antibiotic resistance proteins shared between each pair of samples was calculated 
with shared_phylotypes.py, and significance was calculated as above. 

Procrustes. The 16S OTU table rarefied to 7,000 sequences/sample and the 
antibiotic resistance abundance table were both filtered to include only samples 
interrogated with both methods (n= 172). Bray-Curtis dissimilarity matrices 
were calculated for both, principal coordinates analysis was performed, and 
procrustes analysis was performed on the PCoA results (QUME script trans- 
form_coordinate_matrices.py) across 172 dimensions with 999 permutations to 
determine significance. 

Identification of discriminatory taxa using LEfSe. Taxa summaries were created 
from the rarefied OTU table and filtered to levels L1 to L5 (kingdom - family). 
LEfSe** was used to identify taxa that were discriminative between categories. 
Alpha was 0.05 for both the Kruskal-Wallis and pairwise Wilcoxon rank-sum 
tests. The Linear Discriminant Analysis (LDA) effect size threshold was set at 
3.0 or 4.0, and all-against-all comparison was performed. 

Identification of discriminatory antibiotic resistance functions. The antibi- 
otic resistance table was filtered to only antibiotic resistance proteins detected in 
the sample set under consideration and summarized by each metadata category 
with the QHUME script summarize_taxa.py. Absolute abundances rather than rel- 
ative abundances were calculated, because antibiotic resistance genes represent 
a small fraction of the total metagenome. Supervised learning with the Random 
Forests classifier was performed to identify antibiotic resistance categories that 
most discriminated between sample categories (supervised_learning.py, 500 trees, 
and tenfold cross-validation). The top discriminatory antibiotic resistance cate- 
gories (as determined by feature importance scores) enriched in subsets of the 
samples were plotted onto the principal coordinates of all samples, using the abun- 
dance-weighted average for the coordinates. The size of the point is proportional 
to the overall abundance across all samples under consideration (1/100 RPKM). 
SourceTracker. We estimated the proportion of each latrine, soil, and water micro- 
biota attributable to RES human faeces and the proportion of each sewage microbi- 
ota attributable to PST human faeces using SourceTracker”’. Antibiotic resistance 
tables were filtered to remove any samples and markers without any observations. 
SourceTracker was run through QIIME with default settings using human faecal 
microbiota as source. Jigger was introduced in boxplots to allow visualization, but 
some samples with very low abundance are not distinguishable. 

Mass spectrometry-based detection of antibiotics in WWTP samples. Solid 
phase extraction and ultra-performance liquid chromatography-tandem mass 
spectrometry (UPLC-MS/MS) were used to detect the presence of 16 antibiotics 
from 7 classes in 11 influent and 11 effluent sample supernatants using a modified 
protocol”. Fifty ml of influent or effluent sample were centrifuged at 10,000 r.p.m. 
for 10 min; supernatants were concentrated using solid phase extraction cartridges 
(6cc, 200 mg, Waters, Milford, MA). Samples were analysed on a 4000 QTRAP 
LC/MS/MS instrument (AB Sciex). The data was normalized based on the internal 
standard (isotopically labelled caffeine, 13C3, Cambridge Isotope Laboratories) to 
account for experimental variation and antibiotic extraction/ionization efficiency. 
A mixture of the antibiotic standards was also processed along with the samples as 
positive controls to test for recovery. Standards for amoxicillin, cefotaxime, cefox- 
itin, chloramphenicol, ciprofloxacin, clindamycin, erythromycin, penicillin G, 
sulfamethoxazole, tetracycline, and trimethoprim were purchased from Sigma- 
Aldrich (St Louis, MO). Standards for azithromycin, aztreonam, cefepime, doxy- 
cycline, and tigecycline were purchased from AK Scientific (Union City, CA). All 
analyses were performed at the Proteomics and Mass Spectrometry Facility at the 
Donald Danforth Plant Science Center (St Louis, MO). 

Network creation. Antibiotic resistance gene networks were generated based on 
ShortBRED results of human and environmental metagenomes, filtering hits with 
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RPKM value of <0.1 to determine sharing across samples. Graphics were generated 
in Cytoscape 3.2.1 using an edge-weighted spring-embedded layout. 

Mobile genetic elements (MGEs). Putative MGEs were identified from the func- 
tional selection contigs by Pfam and TIGRfam annotation (Supplementary Table 13). 
Antibiotic-resistant ORFs were considered co-localized with an MGE if they shared 
a contig with a MGE ORE. 365 (19%) antibiotic resistance contigs contained at least 
one of 236 unique MGEs (100% amino acid identical). 

Multidrug resistance clusters. Multidrug resistance clusters were identified as 
contigs containing multiple antibiotic resistance proteins identified from our selec- 
tions and/or annotated with a core Resfams profile HMM (ref. 23), regardless 
of the antibiotic used in the selection. 200 (10%) contigs contained multidrug 
resistance clusters. 

Mobilome analysis. For ‘mobilome’ analyses, the ShortBRED markers were 
restricted to those that included an antibiotic resistance protein encoded in our 
functional metagenomic selection contigs adjacent to a putative MGE. 
Phylogenetic classification of contigs with PhyloPythia. Antibiotic-resistant 
contigs and metagenomic assembly contigs were classified by PhyloPythia***” 
using the recommended model (800 genera, 2013). 

Data reporting. No statistical methods were used to predetermine sample size. The 
experiments were not randomized. The investigators were not blinded to allocation 
during experiments and outcome assessment. 
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Peru: Shanty town (PST) 
- Peri-urban slum 15 km south of Lima 


- Mostly low-income families 


- ~15,000 inhabitants / km? 


- Access to centralized sewage system 


a 
Rural El Salvador (RES) 
- Agricultural village 35 km south of San Salvador 
- Subsistence farmers 
- ~80 inhabitants / km? 
- Use of composting latrines for excreta disposal 
b 
Samples Samples 
interrogated with OTUs interrogated with AR genes 
16S seq. shotgun seq. 
Human 226 1641 191 584 
Post-treatment sewage 13 2220 7 82 
Pre-treatment sewage 30 2324 27 514 
Animal 14 2409 10 283 
Water 22 3025 4 8 
Latrine 36 3245 16 242 
Soil 84 12503 30 206 
Total 425 19301 285 797 
c 
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Extended Data Figure 1 | Overview of study and methods. a, Location 
and overview of study sites in El Salvador and Peru. RES photographs by 
the authors G.S.S. and M.M.B., PST photographs by the author P.T. 


b, Antibiotic resistance markers and OTUs detected vs number of samples 
interrogated by whole metagenome and 16S sequencing by habitat in RES 


and PST. c, Proportion of metagenomic libraries (1 = 67), all antibiotic 
resistance proteins identified from functional metagenomic selections 
(n=1,100), and novel antibiotic resistance proteins identified from 


functional metagenomic selections ( = 121) originating from each 
microbial habitat. The percent of total libraries/proteins in that category 
originating from each microbial habitat is listed above the bar. For all 
antibiotic resistance proteins and novel antibiotic resistance proteins, the 
total sums to >100% due to proteins identified in more than one habitat. 
The number of novel antibiotic resistance proteins vs libraries screened 
was significantly different than expected compared to the total for human 
and latrines (chi-squared test, P< 0.005). 
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16S: Weighted UniFrac PC1 (32.5%) 
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Bray-Curtis: Procrustes Analysis (16S vs. Resistome) 


Extended Data Figure 2 | Human faecal and environmental microbiota 
from RES and PST. Microbiota are coloured by habitat. a, PCoA of 
Bray-Curtis distances between resistomes. (n = 86, n= 10, n= 16, 

n= 30, n=4, n=27 and n=7 for human, animal, latrine, soil, water, 
pre-treatment sewage and post-treatment sewage, respectively) Adonis 
R*?=22.4%, P< 0.001. b, PCoA of weighted UniFrac distances between 


PCi 


microbiota. (n= 105, n= 14, n= 36, n= 84, n= 22, n=30 and n= 13 
for human, animal, latrine, soil, water, pre-treatment sewage and 
post-treatment sewage, respectively.) Adonis R’ = 41.9%, P< 0.001. 

c, Procrustes transformation of taxonomic composition vs resistome. 
Only samples interrogated with both methods were included (n= 172). 
M? = 0.360, P< 0.001 (172 dimensions, 999 permutations). 
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PST 
(peri-urban industrializing) 
RES 
(rural agriculturalist) 
Malawi 
(rural agriculturalist) 
USA* 
(urban industrial) 
Venezuela * 
(rural agriculturalist) 
Matses ** 
(hunter-gatherer) 
Norman, USA ** 
(urban industrial) 

@ Tunapuco * 
(rural agriculturalist) 
Yanomami *** 
(hunter-gatherer) 


DISCRIMINATIVE TAXA: 
A. Actinobacteria 
B. Actinobacteria | Bifidobacteriales 
CG. Actinobacteria | Bifidobacteriales | Bifidobacteriaceae 
D. Clostridia 
E. Clostridia | Clostridiales 
F. Clostridia | Clostridiales | Lachnospiraceae 
2 G. Erysipelotrichi 
By H. Erysipelotrichi | Erysipelotrichales 
# I. Erysipelotrichi | Erysipelotrichales | Erysipelotrichaceae 
8 J. Bacilli! Lactobacillales | Lactobacillaceae 
| K. Bacteroidia 
L. Bacteroidia | Bacteroidales 
M. Bacteroidia | Bacteroidales | Prevotellaceae 
N. Bacteroidia | Bacteroidales | Bacteroidaceae 
O. Opitutae | [Cerasicoccales] 
P. Opitutae | [Cerasicoccales] | [Cerasicoccaceae] 
Q. Gammaproteobacteria 
R. Gammaproteobacteria | Enterobacteriales 
S. Gammaproteobacteria | Enterobacteriales | Enterobacteriaceae 
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@ Urban industrial 
@ Rural agriculturalist 


LIFESTYLE: 


. Actinobacteria 

. Actinobacteria | Bifidobacteriales 

. Actinobacteria | Bifidobacteriales | Bifidobacteriaceae 
. Elusimicrobia | Elusimicrobia 


Elusimicrobia | Elusimicrobia | Elusimicrobiales 
Elusimicrobia | Elusimicrobia | Elusimicrobiales | 
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Extended Data Figure 3 | See next page for caption. 
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Extended Data Figure 3 | Phylogenetic composition of RES and PST 
human faecal microbiota and published microbiota from previous 
studies!*!>5, a, b, e, f, RES vs PST. (RES n= 60, PST n=45) ¢, d, g, h, 
RES and PST vs published human microbiota. (RES n = 60, PST n= 46, 
other n = 446; see Supplementary Table 14) a, PCoA of weighted UniFrac 
distances between RES and PST human faecal microbiota, coloured by 
cohort. Adonis R? = 29.7%, P< 0.001. b, Taxa discriminating between 
RES and PST human faecal microbiota as determined by LEfSe. The 
phylogenetic tree includes all kingdom- to family-level taxa present in 
any sample. Coloured taxa are discriminative between cohorts and have 
an LDA effect size of > 4.0; they are coloured by the cohort in which they 
have the highest abundance. Circle size is relative to the highest abundance 
in either cohort. c, PCoA of weighted UniFrac distances between RES and 
PST human faecal microbiota and published human faecal microbiota, 
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coloured by cohort. Cohorts are labelled by lifestyle and study (* (ref. 19), 
**(ref, 35), ***(ref. 14)). Adonis R* = 37.6%, P< 0.001. d, Taxa 
discriminating between host lifestyles for RES and PST and published 
human faecal microbiota as determined by LEfSe, effect size threshold 
3.0. Discriminative taxa are coloured by the host lifestyle in which they 
are most abundant. e, f, Relative abundances of microbial phyla (e) and 
families (f) in human faecal microbiota from RES and PST. *P < 0.05, 
Wilcox test with Bonferroni correction. g, h, Relative abundances of 
microbial phyla (g) and families (h) in human faecal microbiota from RES 
and PST and published human faecal microbiota, by lifestyle. *P < 0.05, 
Kruskal-Wallis test with Bonferroni correction. e-h, Only taxa with a 
mean relative abundance of > 1% in one cohort/lifestyle are shown. Taxa 
are in order of increasing overall mean relative abundance. Error bars, s.d.; 
centre bars, median. 
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Extended Data Figure 4 | RES and PST human faecal resistomes and 
comparison to the published data sets from ref. 25. a—e, RES and PST 
resistomes, coloured by cohort. (RES n= 42, PST n= 44) f, g, RES and 

PST vs published human data sets, coloured by cohort. (RES n= 42, 

PST n= 44, other n = 53; see Supplementary Table 15) a—c, Absolute 
abundances of antibiotic resistance categories (a), antibiotic targets 

(b), and mechanisms of action (c) in human faecal resistomes from 

RES and PST. Only categories with a mean RPKM of >10 in one cohort 

are shown. Categories are in increasing order of overall mean absolute 
abundance. Abundances are plotted in logy scale. *P < 0.05, Wilcox test 
with Bonferroni correction. d, Number of antibiotic resistance proteins per 
RES and PST human faecal resistome. *P < 0.05, non-parametric Student's 
t-tests. e, PCoA of Bray—Curtis distances between RES and PST resistomes, 
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with abundance-weighted coordinates of the top five most discriminative 
antibiotic resistance categories enriched in each cohort (squares, size 
proportional to overall abundance). Adonis R?=25.0%, P< 0.001. f, PCoA 
of Bray—-Curtis distances between human faecal resistomes from RES and 
PST and ref. 25. Adonis R?= 19.7%, P< 0.001. g, Total reads mapping to 
antibiotic resistance markers per person (normalized by marker length) 
normalized by the total reads in that sample in RES and PST and published 
human faecal microbiota, by cohort. Includes both paired and unpaired 
reads. The overall distribution of normalized antibiotic resistance 

read depth was significantly different than expected (Kruskal-Wallis, 
P<1x107!). n.s., not significant. All other comparisons are P < 0.05, 
Wilcox test with Bonferroni correction. a—d, f, Error bars, s.d.; centre bars, 
median. 
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Extended Data Figure 5 | RES human faecal and environmental 
microbiota and resistomes. a, b, Relative abundances of microbial phyla 
(a) and families (b) in RES microbiota, by habitat. (n = 60, n= 6, n= 36, 
n= 84 and n= 22 for human, animal, latrine, soil, water, respectively) 
Only taxa with a mean relative abundance of > 1% in one habitat are 
shown. Taxa are in increasing order of overall mean relative abundance. 
*P < 0.05, Kruskal-Wallis test with Bonferroni correction. c, d, Absolute 


abundances of antibiotic resistance categories (c) and antibiotic targets 

(d) in RES resistomes, by habitat. (n= 42, n= 4, n= 16, n=30 andn=4 
for human, animal, latrine, soil, water, respectively). Only categories with a 
mean RPKM of >10 in one habitat are shown. Categories are in increasing 
order of overall mean absolute abundance. Abundances are plotted in logo 
scale. *P < 0.05, Kruskal-Wallis test with Bonferroni correction. 

a-d, Error bars, s.d.; centre bars, median. 
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SEWAGE STAGE: @ Human gut & Street-access sewage i Sewage plant influent &3 Sewage plant effluent 


Extended Data Figure 6 | PST human faecal and environmental 
microbiota and resistomes. a, b, Relative abundances of microbial phyla 
(a) and families (b) in human faecal and sewage microbiota from PST, by 


14andn 


stage. (n=45,n=16,n 


13 for human, street-access, influent 


and effluent, respectively) Only taxa with a mean relative abundance 

of > 1% in one stage are shown. Taxa are in increasing order of overall 
mean relative abundance. *P < 0.05, Kruskal-Wallis test with Bonferroni 
correction. c, d, Absolute abundances of antibiotic resistance categories 


(c) and antibiotic targets (d) in PST resistomes, by stage. (n = 44, 
n=14,n=13 and n=7 for human, street-access, influent and effluent, 
respectively). Only categories with a mean RPKM of >10 in one stage 
are shown. Categories are in increasing order of overall mean absolute 
abundance. Abundances are plotted in logyo scale. *P < 0.05, Kruskal- 
Wallis test with Bonferroni correction. a—d, Error bars, s.d.; centre bars, 
median. 
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Extended Data Figure 7 | Antibiotic resistance gene sharing across 
habitats. a, Highly cosmopolitan antibiotic resistance proteins. The 
prevalence of each antibiotic resistance protein in metagenomes from 
each microbial habitat is depicted for all proteins detected in six of the 
seven habitats (n = 21). Detection was based on ShortBRED quantification 
of the protein in each metagenome. Prevalences for an antibiotic 
resistance protein are linked by lines of the same colour. The shape of 
each point reflects the number of habitats in which it was found, as well 
as the minimum prevalence within each habitat. The legend lists the 
annotation for each protein. b, Protein sequences of antibiotic resistance 
genes isolated from functional metagenomic selections were clustered 


at 100% amino acid identity, and the number of metagenomic libraries, 
microbial habitats (for example, human faecal, soil), and cohorts in which 
each unique protein (nm = 1,100) was encoded were calculated across all 
members of the cluster. Antibiotic resistance contigs (n = 1,955) were 
clustered at 90% local identity to identify different genetic contexts, 

and the number of genetic contexts in which each unique protein was 
encoded was calculated across all contigs encoding a protein in that 
cluster. Spearman's rho = 0.59, P< 2.2 x 10~!°, number of genetic contexts 
vs libraries; rho = 0.47, P< 2.2 x 10~!°, number of genetic contexts vs. 
habitats; Wilcox test, P< 2.2 x 10~'°, number of genetic contexts vs 
cohorts (one or both). 
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Extended Data Figure 8 | Mobilome analyses. a, PCoA of Bray-Curtis 
distances between RES and PST human and environmental resistomes, 
coloured by habitat. (n = 86, n= 10, n= 16, n=30, n= 4, n= 27 and 

n=7 for human, animal, latrine, soil, water, pre-treatment sewage and 
post-treatment sewage, respectively) Adonis R?=24.1%, P< 0.001. 

b, Procrustes transformation of taxonomic composition vs resistome. 
Only samples interrogated with both methods were included (n= 172). 
M? =0.493, P< 0.001 (172 dimensions, 999 permutations). c, PCoA of 
Bray-Curtis distances between RES (n= 42) and PST (n= 44) resistomes, 
coloured by cohort. Adonis R* = 31.0%, P< 0.001. d-f, RES human faecal 
and environmental microbiota and resistomes, coloured by habitat. 

(n= 42, n=4, n=16, n=30 and n=4 for human, animal, latrine, soil, 


water, respectively). d, PCoA of Bray—Curtis distances between resistomes. 


Adonis R? = 32.0%, P< 0.001. e, Observed antibiotic resistance proteins. 
*P < 0.05, non-parametric Student's t-tests, Bonferroni correction. 

f, Percentage of latrine, soil, and water resistomes attributable to human 
faeces, as determined by SourceTracker”’. *P < 0.05, pairwise Wilcox tests, 
Bonferroni correction. g-i, PST human faecal and sewage microbiota and 
resistomes, coloured by stage. (n = 44, n= 14, n= 13 and n=7 for human, 
street-access, influent, effluent, respectively). g, PCoA of Bray—Curtis 
distances between resistomes. Adonis R? = 34.8%, P < 0.001. h, Observed 
antibiotic resistance proteins. *P < 0.05, non-parametric Student's t-tests, 
Bonferroni correction. i, Percentage of sewage resistomes attributable 

to human faeces at each sewage treatment stage, as determined by 
SourceTracker. *P < 0.05, pairwise Wilcox tests, Bonferroni correction. 
Error bars, s.d.; centre bars, median. 
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No Sun-like dynamo on the active star 
¢ Andromedae from starspot asymmetry 


R. M. Roettenbacher!, J. D. Monnier!, H. Korhonen2", A. N. Aarnio!, F. Baron!4, X. Che!, R. O. Harmon’, Zs. K6vari®, S. Kraus)’, 


G. H. Schaefer®, G. Torres®, M. Zhao!°, T. A. ten Brummelaar®, J. Sturmann® & L. Sturmann 


Sunspots are cool areas caused by strong surface magnetic fields 
that inhibit convection’. Moreover, strong magnetic fields can 
alter the average atmospheric structure’, degrading our ability to 
measure stellar masses and ages. Stars that are more active than 
the Sun have more and stronger dark spots than does the Sun, 
including on the rotational pole*. Doppler imaging, which has so 
far produced the most detailed images of surface structures on 
other stars, cannot always distinguish the hemisphere in which 
the starspots are located, especially in the equatorial region 
and if the data quality is not optimal®. This leads to problems 
in investigating the north-south distribution of starspot active 
latitudes (those latitudes with more starspot activity); this 
distribution is a crucial constraint of dynamo theory. Polar spots, 
whose existence is inferred from Doppler tomography, could 
plausibly be observational artefacts®. Here we report imaging 
of the old, magnetically active star C Andromedae using long- 
baseline infrared interferometry. In our data, a dark polar spot 
is seen in each of two observation epochs, whereas lower-latitude 
spot structures in both hemispheres do not persist between 
observations, revealing global starspot asymmetries. The north- 
south symmetry of active latitudes observed on the Sun’ is absent 
on ¢ And, which hosts global spot patterns that cannot be produced 
by solar-type dynamos®. 

¢ And is a nearby active star that is both spatially large and spotted, 
making it one of a small number of promising targets for imaging with 
current interferometric capabilities. ¢ And is a tidally locked close 
binary (RS CVn) system consisting of a K-type cool giant star and an 
unseen lower-mass companion star’. Tidal interactions have spun-up 
the cool primary component, causing unusually strong starspots and 
magnetic activity*"”. 

We observed ¢ And during two observing campaigns of eleven 
nights spanning Universal Time (ur) 9-22 July 2011 and fourteen 
nights spanning uT 12-30 September 2013 (see Extended Data 
Table 1) with the Michigan InfraRed Combiner (MIRC)! using all 
six telescopes at Georgia State University’s Center for High Angular 
Resolution Astronomy (CHARA) Array!” on Mount Wilson, 
California, USA. 

The 2011 and 2013 data sets were separately imaged onto a prolate 
ellipsoid using the imaging software SURFING (SURFace ImagING), 
an aperture synthesis imaging technique (J.D.M., manuscript in 
preparation). This approach replicates the fundamental ideas behind 
Doppler imaging in that the whole data set is mapped onto the rotat- 
ing surface at once instead of night-by-night snapshots. Treating each 
data set as an ensemble also allows SURFING to fit stellar and orbital 
parameters (see Table 1) along with the surface temperature maps 
(see Figs 1 and 2). 
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The surface temperature maps for C And show peaks of 4,530 K 
and 4,550 K and minimum values of 3,540 K and 3,660 K in 2011 and 
2013, respectively. The ~900-K range of temperatures we see across 
the surface is slightly larger than the ~700-K range found from recent 
Doppler imaging work (from the Fe 1 \=6,430 A line). A strong dark 
polar spot is present in both of our imaging epochs, also consistent 
with recent Doppler imaging studies”'*!*, In contrast to this persistent 
feature, many other large dark regions change completely between 
2011 and 2013 with no apparent overall symmetry or pattern. These 
features and their locations can only unambiguously be imaged by 
interferometry, since Doppler and light-curve inversion imaging tech- 
niques experience latitude degeneracies (see Methods section for more 
details). We now discuss the starspot implications on the dynamical 
large-scale magnetic field of ¢ And. 

The extended network of cool regions stretching across the star 
suggest that strong magnetic fields can suppress convection on global 
scales, rather than just local concentrations forming spot structures. 
The extent to which starspots can cover the surface of a star is at pres- 
ent unknown and of interest in understanding how activity saturates 
on rapidly rotating, convective stars'®. The observations in hand 
lend support to studies that have suggested magnetic activity can be 
so widespread as to alter the apparent fundamental parameters of a 
star!®!7, For example, a larger region of suppressed convection gives 
a lower observed temperature, leading to inaccurate estimates for 
stellar mass and age®. The changes in global magnetic features will 
produce long-term photometric variations that are often attributed 
only to changes in a growing or shrinking polar starspot. We note 
that a polar starspot for ¢ And does not affect the flux of the star as 
much as other large-scale magnetic structures do, owing to the effects 


Table 1 | Parameters of ¢ And 


Measured parameter Value 
Angular polar diameter, Lp (mas) 2.502 + 0.008 
Polar radius (Re) 15.0+0.8 
Oblateness (major to polar axis) 1.060+0.011 
nclination, i (°) 70.0428 
Pole position angle (°, E of N) 126.0+1.9 


Values from the literature 
Distance, d (pc) 

Effective temperature, Ter (K) 
Luminosity, log L/Lo 

Primary mass (Mo) 
Secondary mass (Mo) ~0.75 (ref. 9) 

ron metallicity [Fe/H]/[Fe/H]o —0.30+0.05 (ref. 9) 

SURFING models assumed a circular orbit (eccentricity e=0) using circular radial velocity 
conventions with an orbital period Por = 17.7694260 + 0.00004 days and time of nodal passage 


To =49992.281 +0.017 (MJD)?°. Limb darkening was held fixed with power-law exponent 
j1=0.269, appropriate for ¢ And based upon spectral type. 


17.98+0.83 (ref. 29) 
4600+ 100 (ref. 9) 
1.98 £0.04 (ref. 9) 

2.6+£0.4 (ref. 9) 
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Atlanta, Georgia 30303, USA. "Department of Physics and Astronomy, Ohio Wesleyan University, Delaware, Ohio 48103, USA. ®Konkoly Observatory, Research Center for Astronomy and Earth 
Sciences, Hungarian Academy of Sciences, H-1121 Budapest, Konkoly Thege Miklés Wit 15-17, Hungary. School of Physics, University of Exeter, Exeter, EX4 4QL, UK. 8Center for High Angular 
Resolution Astronomy, Georgia State University, Mount Wilson, California 91023, USA. 9Harvard-Smithsonian Center for Astrophysics, 60 Garden Street, Cambridge, Massachusetts 02138, USA. 
1Department of Astronomy and Astrophysics, Pennsylvania State University, State College, Pennsylvania 16802, USA. 
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Figure 1 | Surface image of ¢ And from July 2011 with eleven nights 
of data using SURFING. a, The temperature of € And is presented in an 
Aitoff projection. The contours represent every 200 K from 3,400 K to 
4,600 K. The dashed line at the bottom pole indicates the latitudes below, 
which are hidden owing to the inclination. Arrows point to example 


of limb darkening and foreshortening on this highly inclined system 
(i~70.0°). 

The interferometric images of C And provide a clear confirmation 
of the existence of polar spots. Polar spots have been seen in Doppler 
images of ¢ And (refs 9, 13 and 14) and of many other active stars*. 
Polar spots produce spectral line-profile changes only in the line core 
itself (no Doppler shift), and the spectral signature of a symmetric 
polar spot is the same at each rotational phase of the star. This signa- 
ture means that polar spots can very easily be produced as artefacts in 
the Doppler imaging process; if the depth of the spectral line profile is 
not correctly modelled, then the image will exhibit a polar spot. Strong 
chromospheric activity has also been postulated to fill in at least some 
of the photospheric lines used in Doppler imaging, potentially pro- 
ducing a polar spot!®'°. These facts made the existence of polar spots 
a matter of debate in the early days of Doppler imaging”*”! and this is 
now independent confirmation of their existence, settling the debate. 

The interferometric images of ¢ And presented here reveal the exact 
hemispheres of the spots and show strong asymmetries between the 
hemispheres, with the 2011 map showing dominant spots on the 
northern and the 2013 map showing them on the southern hemi- 
sphere. On the Sun the spots are typically seen on both hemispheres 
in certain active mid-latitude regions, with some breaking of the sym- 
metry in the spot numbers on the two hemispheres’ but not as large an 
asymmetry as seen on C And. Such asymmetries require a departure 
from the solar-type aQ-dynamo to a more complicated dynamo for 
¢ And, such as one with mixed parity modes”. 

Although our results strictly apply only for giant stars in RS CVn 
binaries, we note strong parallels between the physical conditions and 
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transient starspots. b, The surface reflects how the star is observed on 

the sky with H-band intensities (mean H = 1.64). The phase ¢ = 0.000 
plot shows longitude 0° at the bottom right of the star with 90° across the 
middle. The phases assume circular orbit radial velocity conventions. The 
images are oriented with north up and east to the left. 


magnetic behaviour of these and pre-main-sequence stars. To reach 
these conditions, the giant primary stars in RS CVn binaries rotate 
rapidly owing to tidal spin-up and pre-main-sequence stars rotate 
rapidly owing to contraction and angular momentum transfer caused 
by accretion of material from a circumstellar disk. These similar phys- 
ical conditions hint at shared field-generation mechanisms that are 
observationally indistinguishable”* and manifest as starspots. In young 
associations, it has been noted that derived ages are probably strongly 
affected by global suppression of convection’. These commonalities 
and the known consequences argue that strong stellar magnetism must 
be accounted for in models for both pre-main-sequence stars and for 
the stages of evolution in the most active giant stars. 

Results from imaging studies using light-curve inversion and 
Doppler imaging techniques, as well as new interferometric spot 
studies”, all re-enforce the picture that global magnetic structures 
cover the faces of the most active stars. Our interferometric imag- 
ing has found unambiguous signposts of these structures and clearly 
points to a perspective beyond the typical isolated spots observed on 
the Sun. The large-scale suppression of convection by these global 
magnetic fields will have structural effects on the stellar atmosphere, 
including puffing up the star and decreasing the effective temperature 
and luminosity, dramatic alterations that must be accounted for by 
modern stellar structure calculations especially for young, low-mass 
stars that universally show strong magnetic activity>”>. 

To understand these structural effects, we must image more targets 
with as much detail as possible. The procedures used here can pro- 
vide similar H-band images for a handful of bright, spatially large, 
spotted stars (such as o Geminorum and ) Andromedae). Impending 
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Figure 2 | Surface images of ¢ And from September 2013 with fourteen 
nights of data using SURFING. a and b are presented as in Fig. 1, except 
that the 200-K contours of the Aitoff projection (a) range from 3,600 K to 
4,600 K. The polar spot is observed to have evolved between the two sets of 


advances in visible interferometry will allow for similar resolution on 
more stars (down to 61.1 mas). For stars that cannot be resolved in 
detail, combining interferometrically observed photocentre shifts due 
to rotation of starspots in and out of view with Doppler imaging would 
resolve the degeneracies inherent in the Doppler images, allowing for 
more accurate surface maps. By acquiring a number of these maps on 
several stars or a few observation epochs of the same targets, we could 
investigate how the changing magnetic field affects our determinations 
of stellar parameters (including mass and age)*”®. In addition, the 
development of new dynamo models would shed light on the impact 
of magnetism on stellar evolution?””®. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 


Starspot imaging methods. Large spots covering a substantial fraction of the 
stellar surface have been indirectly imaged using light-curve inversion and 
Doppler imaging techniques*!”. Light-curve inversion reproduces spotted 
stellar surfaces based on time-series data and can only reproduce structures 
observed as rotational modulations—structures such as static polar spots are 
practically invisible to light-curve inversion imaging. Light-curve inversion typ- 
ically reveals only weak relative latitude information for the spots, although this 
can be improved with the combination of concurrent observations in multiple 
filters**, A more detailed surface map, both in latitude and longitude, can be 
obtained with Doppler imaging, which creates surface temperature maps from 
tracking small changes in absorption lines as starspots rotate in and out of view”. 
Nevertheless, this method cannot always distinguish the hemisphere in which 
the structures are located. To confirm important findings from these methods 
and to understand the global characteristics of activity that can alter stellar radii 
and effective temperature, a more direct imaging method is required that is 
immune to these ambiguities. 

The nearest magnetically active stars are too small to be resolved by even 

our largest telescopes. However, long-baseline interferometry has the potential 
to image sub-milliarcsecond features on the surfaces of nearby stars. To date, 
interferometric imaging has been used successfully to confirm the oblateness 
and gravity darkening of rapidly rotating stars** and even to image a spotted 
stellar surface”*. To improve the resolution and imaging quality for rotating stars 
that show strong magnetic activity, we present here an ‘imaging-on-a-sphere’ 
technique that uses interferometeric observations from multiple nights to con- 
strain a surface temperature map. This naturally takes advantage of the multiple 
views we have of starspot structures as they rotate across the disk of the star. 
Interferometric imaging produces unique images that resolve the degeneracies 
in latitude of Doppler imaging and provides the first independent confirmation 
of the existence of polar starspots. 
Interferometric data. Our interferometric observations were obtained using the 
Michigan InfraRed Combiner (MIRC) at the Center for High Angular Resolution 
Astronomy (CHARA) Array. The CHARA Array consists of six 1-m telescopes 
in a Y-shaped configuration with baselines ranging from 34m to 331m (ref. 12). 
For the ¢ And data, MIRC! combined light from all six CHARA Array telescopes 
in the H-band (eight channels across 1.5-1.8}1m for \/AA~ 40), resulting in an 
angular resolution of A/2B~ 0.5 milliarcseconds, where B is the longest baseline 
used by the interferometer, that is, the distance between the two farthest apart 
telescopes of the interferometer. 

The data were reduced and calibrated with the standard MIRC pipeline®. 
We searched without success for evidence of the faint companion in our interfer- 
ometric data using our proven grid search method*, and could secure only a lo 
lower limit of 300:1 on the H-band flux ratio between primary and companion. 

The data products obtained from reducing the CHARA/MIRC data with the 

standard pipelines consist of visibilities, closure phases and triple amplitudes. 
Representative samples of the visibilities and closure phases are presented 
in Extended Data Figs 1 and 2 for a single night (ur 15 September 2013) of 
six-telescope CHARA/MIRC observations of ¢ And. The reduced data are avail- 
able in OI-FITS format*” upon request. 
Calibration stars. The twenty-five nights of interferometric data span 9-22 July 
2011 and 12-30 September 2013. For these nights of observation we use four 
calibration stars (37 Andromedae, ¥ Pegasi, 1 Trianguli and 58 Ophiuchi) inter- 
spersed with observations of the target star ¢ And. + Peg, + Tri, and 58 Oph are 
modelled as spherical, uniform disk stars** with their parameters as in Extended 
Data Table 2. 

The calibrator 37 And isa recently discovered binary system*’ with a primary- 
to-secondary H-band flux ratio of 80 + 20. Ordinarily, binary stars make poor 
calibrators, but the 37 And system was observed enough times to determine 
its orbit precisely and salvage its use for calibrating our primary target ¢ And. 
We detect the companion of 37 And in nineteen nights of data (see Extended 
Data Table 3) using a grid search for the companion. To constrain orbital param- 
eters, we combined the visual orbit with the primary star’s radial velocity curve 
obtained with archival spectra from the ELODIE high-resolution échelle spec- 
trograph formerly on a 1.93-m telescope at Observatoire de Haute-Provence, 
France’. Extended Data Figs 3 and 4 show the system orbit and radial velocity 
curve and Extended Data Table 4 contains the system orbital parameters. The 
orbital parameters are used in the MIRC calibration pipeline to account for the 
effect of the companion of 37 And. 

SURFING imaging code. The image reconstruction code SURFING (SURFace 
imagING) was specially written for this project: to image surfaces of rotating 
stars. We create a global model of the star, including geometrical parameters 
(polar radius, oblateness, inclination, pole position angle, limb darkening 
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coefficient, rotational period, epoch) as well as the surface temperature map. 
We cover the surface with tiles of equal area using the HEALPix methodology”), 
using 768 tiles to match the spatial resolution of CHARA. Each tile has an area 
of 0.025 mas”. We represent the shape of the stellar photosphere as a prolate 
spheroid to approximate a slightly filled Roche potential. The deviation from 
spherical appears as gravity darkening and accounts for a temperature differ- 
ence of only ~60K, which is much smaller than the temperature variations of 
the starspots. 

We sampled the large range of geometrical models using an affine invariant 
ensemble Markov chain Monte Carlo approach”, with a nested loop to iteratively 
optimize the surface temperature map within each walker of the outer loop (this 
was needed because the 768 free parameters needed to characterize the tempera- 
ture map would be intractable using a Markov chain approach). Extensive testing 
was carried out using blind simulated data to optimize the speed of convergence. 
In addition to minimizing the x” statistic, we also incorporated priors on each 
parameter and could experiment with a variety of imaging regularizers, such as 
total variation or the L2norm of wavelet coefficients. 

In addition to testing the code on simulated data, we were also able to check 
results using soon-to-be-published data on the spotted star \ And (ref. 24), find- 
ing comparable results and confirming the inclination and position angle of the 
pole. We also checked that the imaging-derived orientation of another RS CVn 
(o Gem; R.MLR., J.D.M., H.K., R.O.H., EB., X.C., G. W. Henry, G.H.S., M. Weber 
& K. G. Strassmeier, manuscript in preparation) matched the orbital plane of the 
close companion. The results of these tests and additional details about the imple- 
mentation of our method will be described in a future paper (J.D.M., manuscript 
in preparation). 
¢ And parameters from SURFING. Another test of the robustness of our 
fitting methodology is to determine the stellar orientation for independent 
data sets and compare the results. While the magnetic field structures will 
vary from year to year, the inclination of the pole and its position angle on the 
sky will not. Extended Data Fig. 5 shows a x” surface for three separate years: 
a 2008 pilot set of observations using the MIRC four-telescope system, the 2011 
data set, and the 2013 data set. This figure shows two large regions of reduced 
7 around inclination i= 90° and position angle (PA) = —60° or 120°. These 
regions reflect the oblateness of the star and the basic orientation on the sky. 
The region near inclination i= 70°, PA = 120° has the lowest 7 in all years, and 
especially in 2013. This tells us that the spots move from the southwest towards 
the northeast and not the other way around, and the consistent picture from 
year to year both shows the efficacy of the code and gives confidence that we are 
measuring the true astrophysical signal and not over-fitting noise or systematic 
errors. 

The results from these grid studies have been used to robustly estimate the 
geometrical parameters for ¢ And, and these parameters are found in Table 1. 
The error bars associated with the parameters were determined by combining the 
results of data sets from 2011 and 2013, with the additional data from 2008. The 
2008 data were obtained with only four CHARA telescopes, so they are not of high 
enough quality for imaging. 

To convert H-band intensities from the reconstructed images into photo- 
spheric temperatures, we used Kurucz atmospheric models® for iron metallicity 
[Fe/H] = —0.25 and appropriate surface gravity (logg). Note that the overall tem- 
perature scale in our maps is uncertain (overall multiplicative scaling) owing to 
lack of coeval photometry at H band; here we adopted a mean H-band magnitude 
of H= 1.64, based on archival infrared photometry. 
¢ And imaging tests. The previous section laid out our robust method for deter- 
mining the geometrical parameters of And by using three independent observ- 
ing data from different years. The next issue is to determine the reliability of 
the surface temperature maps. We split the extensive CHARA/MIRC data from 
2013 into two sets of seven nights each, alternating chronologically which night 
went to which set. Since spots take many days to cross the face of the star, each 
data set should be viewing the same spots even though the timing was different. 
We present a comparison of the SURFING results in Extended Data Fig. 6. Both 
partial data sets reproduce the main features observed by the full data set shown 
in the main text. This proves that the features seen in the maps are real and are 
not the result of an over-fitting to poor-quality data or peculiarities of the nightly 
baseline coverage. 

The unprecedented phase coverage in the 2011 and (especially) the 2013 observ- 
ing runs have allowed for textbook imaging fidelity tests for C And. We show that 
the large-scale dark regions that cover ¢ And are highly robust, probably deriving 
from magnetically suppressed convection. 

Code availability. At present, we have opted not to make the SURFING code 
available because of a publication in preparation, which will detail the use and the 
applicability of the resource. 
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Extended Data Figure 1 | Visibility curve of ur 15 September 2013 observations of ¢ And with CHARA/MIRC. The observed visibilities (unitless) are 
plotted in black with 1o error bars and the SURFING model visibilities are overlaid in red. 
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Extended Data Figure 2 | Closure phases of uT 15 September 2013 observations of ¢ And with CHARA/MIRC. Each block represents a temporal 
block of closure phase (CP) observations with data plotted in black (with 1c errors) and SURFING model in red. Each row represents the CP from a 
unique set of three telescopes, labelled in the standard CHARA format. 
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Extended Data Figure 3 | Orbit of 37 And. The grey plus signs represent measurements of the companion (1c errors on detections are smaller than the 
symbols). The observed resolved disk of 37 And is plotted as the black dot at the origin. The thin solid black line is the best-fit orbit from combining the 
interferometric detections and the ELODIE radial velocities. Note that the axis units are milliarcseconds (mas) with north up and east to the left. 
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Extended Data Figure 4 | Radial velocity curve of the primary star of 37 And. The data points are based upon archival ELODIE spectra (with lo error 
bars). The orbital solution used the velocity measurements and the interferometric measurements simultaneously. The solid line is the best-fit orbit and 
the grey lines are fifty Monte Carlo realizations of the orbit. 
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Extended Data Figure 5 | Reduced, normalized x? surface plot of the SURFING model fitted for position angle and inclination of ¢ And. The peak 


is found at PA=126.0°+ 1.9 and i= 70.0° 4 
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Extended Data Figure 6 | Comparison of the 2013 CHARA/MIRC data set divided into two sets of seven nights of data. These projections are plotted 
in the same way as the Aitoff projection in Fig. 2. 
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Extended Data Table 1 | Observations and calibrators of ¢ And 


UT Date 


2011 Jul 9 

2011 Jul 10 
2011 Jul 11 
2011 Jul 12 
2011 Jul 14 
2011 Jul 16 
2011 Jul 17 
2011 Jul 19 
2011 Jul 20 
2011 Jul 21 
2011 Jul 22 
2013 Sep 12 
2013 Sep 13 
2013 Sep 15 
2013 Sep 16 
2013 Sep 17 
2013 Sep 18 
2013 Sep 19 
2013 Sep 20 
2013 Sep 21 
2013 Sep 23 
2013 Sep 24 
2013 Sep 28 
2013 Sep 29 


2013 Sep 30 56565.334 37 And, y Tri 


Modified Julian 
Date (MJD) 


557315536 
S9/52,031 
55753.480 
55754.469 
55756.478 
55758.505 
55759.481 
55761.475 
55762.517 
55763.478 
55764.480 
56547.449 
56548.426 
56550.392 
56551.365 
56552.345 
36553.359 
56554.407 
56555.365 
56556.403 
56558.403 
56559.343 
56563.367 
56564.357 


Calibrators Used 


37 And 

y Peg 

37 And 

y Peg 

y Peg 

58 Oph 

y Peg 

37 And, y Peg 

y Peg 

y Peg, y Tri 

y Peg, y Tri 

37 And, y Tri 

37 And, y Tri 

37 And, y Peg, y Tri 
37 And, y Peg, y Tri 
37 And, y Tri 

37 And, y Peg, y Tri 
37 And, y Peg, y Tri 
37 And, y Peg, y Tri 
37 And, y Tri 

37 And, y Tri 

37 And, y Peg, y Tri 
37 And, y Tri 

37 And, y Tri 
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Extended Data Table 2 | Uniform disk sizes (H-band) of calibrators 


Star Name (HD number) 6 up (mas) 
y Peg (HD 886) 0.41 + 0.03 
y Tri (HD 14055) 0.51 £0.03 
58 Oph (HD 160915) 0.68 + 0.05 


The uniform disk diameters up were obtained with SearchCal®8. 
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Extended Data Table 3 | Binary separation and position angle measurements of 37 And 


UT Date 


2011 Jul 9 

2011 Jul 12 

2011 Jul 19 

2011 Nov 29 
2013 Sep 12 
2013 Sep 13 
2013 Sep 15 
2013 Sep 16 
2013 Sep 17 
2013 Sep 18 
2013 Sep 19 
2013 Sep 20 
2013 Sep 23 
2013 Sep 24 
2013 Sep 28 
2013 Sep 30 
2014 Aug 19 
2014 Aug 20 
2014 Sep 1 


Modified 
Julian 
Date (MJD) 


55751.492 
55754.499 
55761.465 
55894.159 
56547.470 
56548.399 
56550.340 
56551.344 
56552.329 
56553.250 
56554.292 
56555.301 
56558.416 
56559.239 
56563.303 
56565.276 
56888.422 
56889.446 
56901.412 


Separation Position 
(mas) Angle (°, 
E of N) 

11.46 217.0 
9.09 230.6 
6.38 295.4 
63.57 143.9 
82.28 152.6 
82.51 152.7 
82.47 152.8 
82.73 152.9 
82.63 152.9 
82.88 153.0 
83.06 153.1 
83.02 153.2 
83.08 153.4 
83.29 153.5 
83.62 153.7 
83.65 153.8 
14.45 98.2 
15.58 100.8 
22.82 116.9 


Error 
Ellipse 
Major Axis 
(mas) 
0.13 
0.10 
0.06 
1.82 
0.58 
0.02 
0.81 
1.23 
0.48 
0.95 
0.72 
0.79 
1.01 
0.58 
1.08 
0.83 
0.32 
0.24 
0.11 


Error 
Ellipse 
Minor Axis 
(mas) 
0.11 
0.08 
0.05 
0.16 
0.15 
0.10 
0.26 
0.27 
0.17 
0.21 
0.27 
0.14 
0.22 
0.23 
0.21 
0.20 
0.18 
0.17 
0.07 
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270 

330 
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Extended Data Table 4 | Orbital parameters of 37 And 


Parameter Value 
Semimajor axis, a (mas) 46.66 + 0.06 
Eccentricity, e 0.8405 + 0.0009 
Inclination, i (°) 52.5 + 0.3 
Argument of periastron,  (°) 168.9 + 0.3 
Ascending node, Q (°) -17.6+ 0.2 
Orbital period, Po» (days) 550.7 + 0.2 


Time of periastron passage, 7p (MJD) 55765.45 + 0.04 
Velocity semi-amplitude, Ky (km/s)  11.1+0.5 


System velocity, y (km/s) 5.33 + 0.07 


The orbital parameters were obtained by combining the ELODIE radial velocity curve with the CHARA/MIRC detections. 
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Temperate Earth-sized planets transiting a nearby 


ultracool dwarf star 


Michaél Gillon!, Emmanuél Jehin!, Susan M. Lederer’, Laetitia Delrez!, Julien de Wit?, Artem Burdanov!, Valérie Van Grootel!, 
Adam J. Burgasser*, Amaury H. M. J. Triaud°, Cyrielle Opitom!, Brice-Olivier Demory®, Devendra K. Sahu’, 


Daniella Bardalez Gagliuffi*, Pierre Magain! & Didier Queloz® 


Star-like objects with effective temperatures of less than 2,700 kelvin 
are referred to as ‘ultracool dwarfs’!. This heterogeneous group 
includes stars of extremely low mass as well as brown dwarfs 
(substellar objects not massive enough to sustain hydrogen fusion), 
and represents about 15 per cent of the population of astronomical 
objects near the Sun”. Core-accretion theory predicts that, given 
the small masses of these ultracool dwarfs, and the small sizes of 
their protoplanetary disks**, there should be a large but hitherto 
undetected population of terrestrial planets orbiting them®— 
ranging from metal-rich Mercury-sized planets® to more hospitable 
volatile-rich Earth-sized planets’. Here we report observations of 
three short-period Earth-sized planets transiting an ultracool dwarf 
star only 12 parsecs away. The inner two planets receive four times 
and two times the irradiation of Earth, respectively, placing them 
close to the inner edge of the habitable zone of the star®. Our data 
suggest that 11 orbits remain possible for the third planet, the most 
likely resulting in irradiation significantly less than that received by 
Earth. The infrared brightness of the host star, combined with its 
Jupiter-like size, offers the possibility of thoroughly characterizing 
the components of this nearby planetary system. 

TRAPPIST*’° (the TRansiting Planets and PlanestIsimals Small 
Telescope) monitored the brightness of the star TRAPPIST-1 (2MASS 
J23062928 — 0502285) in the very-near infrared (roughly 0.9 xm) at high 
cadence (approximately 1.2 minutes) for 245 hours over 62 nights from 
17 September to 28 December 2015. The resulting light curves show 11 
clear transit-like signatures with amplitudes close to 1% (Extended Data 
Figs 1, 2). Photometric follow-up observations were carried out in the 
visible range with the Himalayan Chandra 2-metre Telescope (HCT) in 
India, and in the infrared range with the 8-metre Very Large Telescope 
(VLT) in Chile and the 3.8-metre UK Infrared Telescope (UKIRT) in 
Hawaii. These extensive data show that nine of the detected signatures 
can be attributed to two planets, TRAPPIST-1b and TRAPPIST- Ic, 
transiting the star every 1.51 days and 2.42 days, respectively (Fig. 1a, b). 
We attribute the two additional transit signals to a third transiting 
planet, TRAPPIST-1d, for which 11 orbital periods—from 4.5 days 
to 72.8 days—are possible on the basis of non-continuous observa- 
tions (Table 1). We cannot discard the possibility that the two transits 
attributed to planet TRAPPIST-1d originate instead from two different 
planets, but the consistency of their main parameters (duration, depth 
and impact parameter) as derived from their individual analyses does 
not favour this scenario. 

TRAPPIST-1 is a well characterized, isolated M8.0 + 0.5-type dwarf 
star'! at a distance of 12.0 0.4 parsecs from Earth as measured by 
its trigonometric parallax’, with an age constrained to be more than 
500 million years (Myr), and with a luminosity, mass and radius of 
0.05%, 8% and 11.5% those of the Sun’, respectively. We determined its 
metallicity to be solar through the analysis of newly acquired infrared 


spectra. The small size of the host star—only slightly larger than 
Jupiter—translates into Earth-like radii for the three discovered planets, 
as deduced from their transit depths. Table 1 presents the physical 
properties of the system, as derived through a global Bayesian analysis 
of the transit photometry (Fig. 1), including the a priori knowledge 
of its stellar properties, with an adaptive Markov chain Monte Carlo 
(MCMC) code. 

We can discard a non-planetary origin of the transit-like signals 
owing to several factors. The first factor is the high proper motion 
of the star (greater than 1” per year), which allowed confirmation 
(through archival images) that no background source of significant 
brightness was located behind it in 2015. The second factor is that the 
star has no physical companion of stellar-like nature (star or brown 
dwarf), as demonstrated by high-resolution images, radial velocities 
and near-infrared spectroscopy. Together, these factors show that the 
signals do not originate from eclipses of larger bodies in front of a 
background or a physically associated stellar-like object blended with 
the ultracool target star. These factors also establish that the light from 
the target is not diluted by an unresolved additional stellar-like object, 
confirming that the measured transit depths reveal planetary radii of 
terrestrial sizes. Other factors include the significant age of the star’, 
its moderate activity and rotation period (P,o,= 1.40 £0.05 days, 
as measured from our photometry), and its low level of photometric 
variability’® (confirmed by our data), all of which are inconsistent with 
exotic scenarios based on ultrafast rotation of photospheric structures, 
or on occultations by circumstellar material of non-planetary origin 
(for example, disk patches or comets)!”. 

Further confirmation of the planetary origin of the transits comes 
from, first, the periodicity of the transits of TRAPPIST-1b and 
TRAPPIST-1c, and the achromaticity of the transits of TRAPPIST-1b 
as observed from 0.85 1m (HCT) to 2.09 1m (VLT) (Fig. 1.a); and 
second, the agreement between the stellar density measured from the 
transit light curves, 49.3*$'30o, with the density inferred from the 
stellar properties, (55.3 + 12.1) pg (where fg is the density of the Sun). 

The masses of the planets, and thus their compositions, remain 
unconstrained by these observations. The results of planetary thermal 
evolution models—and the intense extreme-ultraviolet (1-1,000 A) 
emission of low-mass stars'® during their early lives—make it unlikely 
that such small planets would have thick envelopes of hydrogen and/ 
or helium gases”. Statistical analyses of sub-Neptune-sized planets 
detected by the Kepler spacecraft indicate that most Earth-sized plan- 
ets in close orbit around solar-type stars are rocky**”'. Nonetheless, 
the paucity of material in the inner region of the protoplanetary disk 
of an ultracool dwarf would seem to challenge the in situ formation 
of rocky planets the size of Earth®, favouring instead compositions 
dominated by ice-rich material originating from beyond the ice line’. 
Confirming this hypothesis will require precise mass measurements 
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Figure 1 | Transit photometry of the TRAPPIST-1 planets. Each light 


curve is phased to the time of inferior conjunction (mid-transit time) of 
the object. The light curves are binned in two-minute intervals for planet 
TRAPPIST- 1b (a), and in five-minute intervals for planets TRAPPIST-1c 
(b) and TRAPPIST-1d (c). The best-fit transit models, as derived from 

a global analysis of the data, are overplotted (red lines). The light curves 
are shifted along the y axis for the sake of clarity. For the HCT/Hanle 
faint object spectrograph camera (HFOSC) light curve, the data are 
unbinned and the error bars are the formal measurement errors. For the 
other light curves, the error bars are the standard errors of the mean of 
the measurements in the bin. WFCAM, wide-field infrared camera on the 
UKIRT; HAWK-I, high acuity wide field K-band imager on the VLT. 


so as to break the degeneracy between the relative amounts of iron, 
silicates and ice”*. This should be made possible by next-generation, 
high-precision infrared velocimeters able to measure the low-amplitude 
Doppler signatures (of one-half to a few metres per second) of the 
planets. Alternatively, the planets’ masses could be constrained by 
measuring the transit timing variations (TT Vs) caused by their 


222 | NATURE | VOL 533 | 12 MAY 2016 


1.0 


= 05 yO a. O - 
{o} le > c . al 
= li. _@: Red dwarfs | 
g oe | 
oO E 
E i 4 
3 L 4 @ EPIC 206318379b | 
o © K2-9b 
Y® 0.2 oe eae ‘ = 
r Doi ft @ GJ 1132b 4 
[ oi 8 3) @ GJ1214b | 
i ee oe Ore 5 i 
L = te 8 3 Kepler-42d, c and b 
0.1 i é = 


100 


1,000 


2,000 
Equilibrium temperature (K) 
Figure 2 | Masses of host stars and equilibrium temperatures of known 
sub-Neptune-sized exoplanets. The size of the symbols scales linearly 
with the radius of the planet. The background is colour-coded according 
to stellar mass (in units of the Sun’s mass). The TRAPPIST-1 planets are 
at the boundary between planets associated with hydrogen-burning stars 
and planets associated with brown dwarfs. Equilibrium temperatures are 
estimated neglecting atmospheric effects and assuming an Earth-like 
albedo of 0.3. The positions of the Solar System terrestrial planets are 
shown for reference. The range of possible equilibrium temperatures of 
TRAPPIST-1d is represented by a solid bar; the dot indicates the most 
likely temperature. Only the exoplanets with a measured radius equal to or 
smaller than that of GJ 1214b are included. 


mutual gravitational interactions”, or by transit transmission 
spectroscopy~*. 

Given their short orbital distances, it is likely that the planets are 
tidally locked—that is, that their rotations have been synchronized 
with their orbits by tidal interactions with the host star?°. Planets 
TRAPPIST-1b and TRAPPIST-1c are not in the host star’s habitable 
zone® (within 0.024 to 0.049 astronomical units (AU) of the star, as 
defined by one-dimensional models that are not adequate for mod- 
elling the highly asymmetric climate of tidally locked planets”). 
However, they have low enough equilibrium temperatures that they 
might have habitable regions—in particular, at the western terminators 
of their day sides?’ (Fig. 2 and Table 1). The main concern regarding 
localized habitability on tidally locked planets relates to the trapping 
of atmosphere and/or water on their night sides”°. Nevertheless, 
the relatively large equilibrium temperatures of TRAPPIST-1b and 
TRAPPIST-1c would probably prevent such trapping”’. In contrast, 
TRAPPIST-1d orbits within or beyond the habitable zone of the star, 
its most likely periods corresponding to semi-major axes of between 
0.033 au and 0.093 au. We estimate tidal circularization timescales 
for TRAPPIST-1d (unlike for the two inner planets) to be more than 
1 billion years (see ‘Dynamics of the systen’ in Methods). Tidal heating 
due to a non-zero orbital eccentricity could thus have a significant 
influence on the global energy budget and potential habitability of 
this planet”®. 

The planets’ atmospheric properties, and thus their habitability, will 
depend on several unknown factors. These include the planets’ com- 
positions; their formation and dynamical history (their migration and 
tides); the past evolution and present level of the extreme-ultraviolet 
stellar flux’? (probably strong enough in the past, and perhaps even 
now, to significantly alter the planets’ atmospheric compositions*”); 
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Table 1 | Properties of the TRAPPIST-1 planetary system 
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Parameter Value 


V=18.80+0.08, R=16.47+0.07, /=14.0+0.1, J=11.35+0.02, K=10.30+0.02 


Star TRAPPIST-1=2MASS J23062928 — 0502285 
Magnitudes 
Distance, d, 12.1+0.4 parsecs (ref. 12) 


Luminosity, L, (0.000525 +0,000036)L» (ref. 13) 


Mass, M, (0.080 +0.009)M5 
Radius, R, (0.117 £0.004)R, 
Density, p. 50.3733 p60 


Effective temperature, Ter 
Metallicity, [Fe/H] 


2,950+55 K 


+0.04 + 0.08 (from near-infrared spectroscopy) 


Rotation period, Prot 1.40 +0.05 days (from TRAPPIST photometry) 

Age, Tx >500 Myr (ref. 13) 

Planets TRAPPIST-1b TRAPPIST-1c TRAPPIST-1d 

Orbital period, P 1.510848 + 0.000019 days 2.421848+0.000028 days 4.551, 5.200, 8.090, 9.101, 10.401, 12.135, 14.561, 


Mid-transit time, to — 2,450,000 (BJDtp8) 
Transit depth, (Rp/R,)* 


Transit impact parameter, b 


7,322.51765 +0,00025 
0.754% £0.025% 
(0.21+0.14)R, 
36.12+0.46min 
89.41+0.41 deg 


Transit duration, W 


Orbital inclination, i 


Orbital eccentricity, e 0 (fixed) 
Radius, Rp (1.113 +0.044)R, 
Scale parameter, a/R, 20.45'9:33 


0.01111+000040 Au 
(4.25 +0.38)S5 


Semi-major axis, a 


Irradiation, Sp 

Equilibrium temperature, Teq 
with Bond albedo of 0.00 
with Bond albedo of 0.75 


400+9K 
285+7K 


7,362.72618+0.00033 


18.202, 24.270, 36.408, 72.820 days" 
7294.7741 +0.0013t 
0.826%+£0.073%' 
(0.24+0.15)R,t 


0.672% +:0.042% 
(0.25+0.15)R, 


41.7840.81 min 83.342.5mint 

89.50+0.31 deg 89.87 +.0.10 degt 
0 (fixed) 0 (fixed) 

(1.049 + 0.050)R;, (1.168 +0.068)R.! 
28.079§ 41-2714 

0.01522 +.0.00055 au 0.022-0.146 au? 
(2.2640.21)S; (0.02-1.0)S;3# 

34248K 110-280 K* 
242 46K 75-200 K* 


The values and 1c errors given for the planetary parameters and for the stellar mass (M,), radius (R,), density (p,), and effective temperature (Tet) were deduced from a global analysis of the photometric 
data, including a priori knowledge of the stellar properties (see Methods). BJDrpp, barycentric Julian date in the barycentric dynamical time standard. Lo, Mo, Ro and pg are, respectively, the luminosity, 
mass, radius and density of the Sun. R, and S, are, respectively, the radius and irradiation of the planet; R., and S., are, respectively, the radius and irradiation of Earth. 

*These are the potential orbital periods of TRAPPIST-1d, derived from non-continuous observations. The value in bold type is the most likely value for the period, as derived from the shape of the transits. 


TValues calculated on the basis that P = 18.20175+0.00045 days. 
+The ranges allowed by the set of possible periods. 


and the past and present amplitudes of atmospheric replenishment 
mechanisms (impacts and volcanism). Fortunately, the TRAPPIST-1 
planets are particularly well suited for detailed atmospheric 
characterization—notably by transmission spectroscopy (Fig. 3)— 
because transit signals are inversely proportional to the square of the 
host-star radius, the latter being only around 12% of that of the Sun for 
TRAPPIST-1. Data obtained by the Hubble Space Telescope should 


= 300 u 
a TRAPPIST-1b oR 
2 * GJ 1214b 2 
S 250} co) O2R, 
8 , __ Keplér-42c 656 
2 . 
B 299} TRAPPIST-1c | 
© D 
+ Kepler-42b and d Qo 
£ as = 
8 150; hk ®@ | fho.20 § 
= TRAPPIST-1d Q 
E Be) 
& too} GJ 1132b | 
2 0.05 
2 © — Kepler-444c 
o 450+ K2- 9b, é “uD re Vb 444b 4 
2 iy eek 0.01 
nee 
o Love 219134b Og 
Oo oo : i) 
0) —<_ 1 cote 1,500 2,000 


Equilibrium temperature (K) 


provide initial constraints on the extent and composition of the planets’ 
atmospheres. The next generation of observatories will then allow far 
more in-depth exploration of the atmospheric properties. In particu- 
lar, data from the James Webb Space Telescope should yield strong 
constraints on atmospheric temperatures and on the abundances of 
molecules with large absorption bands including several potential bio- 
markers such as water, carbon dioxide, methane and ozone. 


Figure 3 | Potential for characterizing the atmospheres of known 
transiting sub-Neptune-sized exoplanets. The signal being transmitted 
from each planet is estimated in parts per million (p.p.m.) and for 
transparent water-dominated atmospheres with a mean molecular 
weight, ju, of 19. The signal-to-noise ratio (SNR) in transmission 
(normalized to that of GJ 1214b under the same atmospheric assumptions) 
is also calculated. The estimated signal and SNR are plotted against 
equilibrium temperatures, assuming a Bond albedo of 0.3. The black 
horizontal bar indicates the SNR that will require 200 (or 500) [or 1,000] 
hours of in-transit observations with the James Webb Space Telescope 

to yield a planet’s atmospheric temperature with a relative uncertainty 
below 15% and with abundances within a factor of four in the case of a 
H20 (or Nz) [or CO2]-dominated atmosphere (;: = 19 (or 28) [or 39]). 
Only the exoplanets with a measured radius equal to or smaller than that 
of GJ 1214b are included in the figure. The size of the circular symbol for 
each planet is proportional to the planet's physical size. For illustration, 
symbols for planets of one (Ra) and two (2R») Earth radii are shown at the 
top right. 
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METHODS 

Spectral type, parallax and age of the star. TRAPPIST-1 = 2MASS J23062928 — 
0502285 was discovered in 2000 by a search for nearby ultracool dwarfs according 
to photometric criteria*!, and identified as a high proper-motion (right ascension 
component /t, = 0.89”, declination component jus = —0.42”), moderately active 
(logarithm of H, to bolometric luminosity ratio logLyo/Lpo1= —4.61), M7.5 
dwarf at approximately 11 parsecs from Earth. Subsequent findings converged on 
a spectral type of M8.0 + 0.5 (refs 11, 32), while confirming a moderate level of 
activity typical of stars of similar spectral type in the vicinity of the Sun'**, The 
spectral classification was checked by comparing a low-resolution (R © 150) near- 
infrared spectrum of the star’—obtained with the SpeX spectrograph*> mounted 
on the 3-metre NASA Infrared Telescope Facility—with several spectral-type 
standards; the spectrum of TRAPPIST-1 best fit that of the M8-type standard 
LHS 132 (Extended Data Fig. 3a). The Cerro Tololo Interamerican Observatory 
Parallax Investigation (CTIOPI) project reported the star’s trigonometric parallax 
to be 7=82.6 + 2.6 mas (ref. 12), which translates to a distance of 12.1 +0.4 par- 
secs. High-resolution optical spectroscopy failed to detect significant absorption 
at the 6,708 A lithium line*®, suggesting that the object is not a very young brown 
dwarf, but rather a very-low-mass main-sequence star. This is in agreement with 
its thick disk kinematics”, its relatively slow rotation (projected rotational velocity 
vsini=6 +2km s~!)!5, its moderate activity, and its reported photometric stabil- 
ity’, all of which point to an age of at least 500 Myr (ref. 13). 

Metallicity of the star. We obtained new, near-infrared (0.9-2.5|1m) spectro- 
graphic data for TRAPPIST-1 with the SpeX spectrograph on the night of 18 
November 2015 (universal time), during clear conditions and 0.8” seeing at 
K-band. We used the cross-dispersed mode and 0.3” x 15” slit, aligned at the par- 
allactic angle, to acquire moderate-resolution data (\/A.\ ~ 2,000) with a disper- 
sion of 3.6 A per pixel, covering the spectral range 0.9-2.5 1m in seven orders. Ten 
exposures each of 300 seconds were obtained over an air mass ranging from 1.14 
to 1.17, followed by observations of the AOV star 67 Aqr (V =6.41) at an air mass 
of 1.19 for telluric and flux calibration, as well as internal lamp exposures. Data 
were reduced using the SpeXtool package version 4.04 (refs 37, 38). The reduced 
spectrum has a median signal-to-noise ratio of 300 in the 2.17-2.35-|1m region (see 
Extended Data Fig. 3b; the metallicity-sensitive atomic features of Nai (2.206 jm, 
2.209 xm) and Cai (2.261 jim, 2.263 jum, 2.266 1m) are labelled). We measured 
the equivalent widths of these features and the H,O-K2 index (defined in ref. 39), 
and used the mid- and late-M-dwarf metallicity calibration of ref. 40 to determine 
[Fe/H] = 0.04 + 0.02 (measurement) + 0.07 (systematic) for TRAPPIST-1. The 
quadratic sum of the two errors resulted in our final value for [Fe/H] of 0.04 + 0.08. 
Basic parameters of the star. A recent study!’ derived a luminosity for 
TRAPPIST-1 of (0.000525 + 0.000036)Lo (where Lg is the luminosity of the Sun), 
using as input data the trigonometric parallax and VRI magnitudes as measured 
by the Cerro Tololo Interamerican Observatory Parallax Investigation (CTIOPI) 
project!?, the 2MASS JHK magnitudes", the Wide-Field Infrared Survey Explorer 
(WISE) W123 magnitudes”, an optical spectrum measured with the Kitt Peak 
National Observatory Ritchey—Chretien spectrograph’, and a near-infrared spec- 
trum measured by SpeX/Prism. Using this luminosity and an age constraint of 
>500 Myr, the authors of ref. 13 derived (from evolutionary-model isochrones and 
the Stefan—Boltzmann law) the following values for the mass, radius and effective 
temperature of TRAPPIST-1: M, = (0.082 0.009) Mo, R, = (0.116 +0.004)Ro, and 
Teg¢= 2,557 + 64K. To account for the uncertainties coming from the assumptions 
and details of the evolutionary models, we carried out a new determination of these 
three basic parameters, using recent solar metallicity evolutionary-model isoch- 
rones that consistently couple atmosphere and interior structures**. We obtained 
M,,=0.089Mo, R,=0.112Ro, and Trp= 2,615 K. We then added the difference 
between the two determinations quadratically to the errors of ref. 13, adopting 
finally M, = (0.082 +0.011)Mo, Ry = (0.116 + 0.006)Ro, and Te = 2,555 + 85K. 
We took the normal distributions corresponding to these values and errors as 
prior probability distribution functions in the Bayesian analysis of our photometric 
data (see below). 

Possible binary nature of the star. High-resolution imaging from the ground*-” 
and from space with the Hubble Space Telescope** discarded the existence of a com- 
panion down to an angular distance of 0.1”, corresponding to a projected physical 
distance of 1.2 au at 12 parsecs, and in good agreement with the reported stability 
of the radial velocity of the star at the ~10 ms"! level over one week” and at the 
~150ms"! level over about ten weeks*°. We performed spectral binary template 
fitting?! to the IRTF/SpeX spectroscopy, and statistically reject the presence of an 
L- or T-type brown-dwarf companion that would be visible in a blended-light spec- 
trum. TRAPPIST-1 can thus, in all probability, be considered to be an isolated star. 
Upper magnitude limits on a background eclipsing binary. We measured the 
J2000 equatorial coordinates of TRAPPIST-1 in the 2015 TRAPPIST images, 
using 29 stars from the UCAC2 catalogue® and the Pulkovo Observatory 
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Izmccd astrometric software**. We obtained coordinates of right ascension 
(RA) =23h 06 min 30.34s and declination (dec.) = —05° 02’ 36.44”. Owing to the 
high proper motion of TRAPPIST-1 (~1” per year), we could assess the possible 
presence of a background object by examining this exact position in several previ- 
ous images taken from the POSS (1953; ref. 54) and 2MASS (1998; ref. 41) image 
catalogues. We detected no possible additional source at this position in any of 
these images. The faintest stars detected at other positions in the 2MASS images 
have J-band magnitudes of ~17. We adopt this value as an absolute lower threshold 
for the J-band magnitude of a background source blended with TRAPPIST- 1 in our 
TRAPPIST 2015 images. TRAPPIST-1 has a J-band magnitude of 11.35 (ref. 42), 
and the achromaticity of the transits of TRAPPIST-1b as observed from 0.85 1m to 
2.09 1m means that, if the transits originated from a background eclipsing binary 
(BEB), then that BEB would have to be a very red object with a spectral type similar 
to that of TRAPPIST-1. Combining these two facts, the BEB scenario would require 
an unphysical eclipse depth of more than 100% in the photometric bands probed 
by our observations to match the ~0.8% depths measured after dilution by the light 
from TRAPPIST-1. We thus firmly discard the BEB scenario. 

Photometric observations and analysis. The TRAPPIST®°° observations in which 
the transits were detected consisted of 12,295 exposures, each of 55 seconds, gath- 
ered with a thermoelectrically cooled 2Kx2K CCD camera (field of view of 22’ x 
22’; pixel scale of 0.65”). Most of the observations were obtained through an I+z 
filter with a transmittance greater than 90% from 750 nm to beyond 1,100 nm— 
the effective bandpass in this spectral range being defined by the response of the 
CCD. On the basis of the spectral efficiency model for TRAPPIST and an optical 
spectrum of a spectroscopic standard M8V star (VB10), we compute an effective 
wavelength of 885 + 5 nm for these observations. For the nights of 20 November 
and 19 December 2015, the target was close to the full Moon and the observations 
were performed in the Sloan z’ filter to minimize the background. After a standard 
pre-reduction (bias, dark, flat-field correction), the TRAPPIST automatic pipeline 
extracted the stellar fluxes from the images using the DAOPHOT aperture pho- 
tometry software” for eight different apertures. A careful selection of both the 
photometric aperture size and the stable comparison stars was then performed 
manually to obtain the most accurate differential light curves of the target. 

Photometric follow-up observations were performed with the HAWK-I near- 
infrared imager*’ on the European Southern Observatory (ESO) 8-metre Very 
Large Telescope (Chile), with the HFOSC optical spectro-imager (http://www.iiap. 
res.in/iao_hfosc) on the 2-metre Himalayan Chandra Telescope (India), and with 
the WFCAM® near-infrared camera located at the prime focus of the 3.8-metre 
UKIRT telescope (Hawaii). 

The VLT/HAWK-I observations of a transit of planet TRAPPIST-1b were 
performed during the night of 8 November 2015. HAWK-I is composed of four 
Hawaii 2RG 2,048 x 2,048 pixel detectors (pixel scale = 0.106”). Its total field of 
view on the sky is 7.5’ x 7.5’. The transit was observed through the narrowband 
filter NB2090 (A= 2.095 1m, width = 0.020,1m). 185 exposures, composed of 17 
integrations of 1.7 seconds each, were acquired during the run in ‘stare’ mode—that 
is, without applying a jitter pattern. After standard calibration of the images, stellar 
flux measurement was performed by aperture photometry". 

The HCT/HFOSC observations of a transit of TRAPPIST- 1b were performed 
on 18 November 2015. The imager in the HFOSC CCD detector is an array of 
2,048 x 2,048 pixels, corresponding to a field of view of 10’ x 10’ on-sky (pixel 
scale = 0.3”). The observations consisted of 104 exposures, each of 20 seconds, 
taken in stare mode and in the I filter, centred on the expected transit time. After 
a standard calibration of these images and their photometric reduction with 
DAOPHOT, differential photometry was performed. We estimate the effective 
wavelength of these observations to be 840 + 20 nm, given the spectral response 
of HFOSC and an optical spectrum of the M8V standard star VB10. 

The UKIRT/WFCAM observations of two transits of planet TRAPPIST- 1b and 
one transit of planet TRAPPIST- 1c consisted of three runs of 4 hours each, per- 
formed on 5, 6 and 8 December 2015 in the J-band. WFCAM is composed of four 
HgCdTe detectors of 2,048 x 2,048 pixels each, with a pixel scale of 0.4”, resulting 
in a field of view of 13.65’ x 13.65’ for each detector. On 5 December 2015, 1,365 
exposures composed of three integrations of 2 seconds each were performed in 
stare mode. For the runs on 6 and 8 December 2015, respectively, 1,181 and 1,142 
exposures composed of five one-second exposures were performed, again in stare 
mode and using the same pointing as on 5 December 2015. Differential aperture 
photometry was performed with DAOPHOT on all calibrated images. 

Global analysis of the photometry. We inferred the parameters of the three 
detected planets transiting TRAPPIST-1 from analysis of their transit light curves 
(Extended Data Fig. 1 and Extended Data Table 1) with an adaptive Markov chain 
Monte Carlo (MCMC) code!*. We converted each universal time (uT) of mid- 
exposure to the BJDrpp time system*’. The model assumed for each light curve was 
composed of the eclipse model of ref. 60, multiplied by a baseline model, aiming to 
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represent the other astrophysical and instrumental mechanisms able to produce 
photometric variations. Assuming the same baseline model for all light curves, and 
minimizing the Bayesian information criterion (BIC)°!, we selected a second-order 
time polynomial as a baseline model to represent the curvature of the light curves 
due to the differential extinction and the low-frequency variability of the star, and 
added an instrumental model composed of a second-order polynomial function 
of the positions and widths of the stellar images. 

Stellar metallicity, effective temperature, mass and radius were four free param- 
eters in the MCMC for which prior probability distribution functions (PDFs) were 
selected as input. Here, the normal distributions N(0.04, 0.087) dex, N(2,555, 
85°) K, N(0.082, 0.0117)Mo, and N(0.116, 0.006)Ro were assumed on the basis 
of a priori knowledge of the stellar properties (see the section on ‘Basic parame- 
ters of the star’). Circular orbits were assumed for all transiting objects. For each 
of them, the additional free parameters in the MCMC included: (1) the transit 
depth, dF, defined as (Rp/R,)?, with R, and R, being the planetary and stellar radii, 
respectively; (2) the transit impact parameter b =acosi/R,, with a and i being the 
planet’s semi-major axis and orbital inclination, respectively; (3) the orbital period 
P; (4) the transit width W defined as (P x R,/a) [(1+Rp/R,)* — b?]!?/1; and (5) 
the mid-transit time (time of inferior conjunction) To. Uniform prior distributions 
were assumed for each of these free parameters. At each step of the MCMC, values 
for Rp, a and i, were computed from the values for the transit and stellar parameters; 
values were also computed for the irradiation of the planet in Earth units and for 
its equilibrium temperatures, assuming Bond albedos of 0 and 0.75, respectively. 
A quadratic limb-darkening law® was assumed for the star. For each bandpass, 
values and errors for the limb-darkening coefficients u; and uz were derived 
from the tables in ref. 62 (Extended Data Table 2), and the corresponding normal 
distributions were used as prior PDFs in the MCMC. 1, and u were free parameters 
under the control of these PDFs in the MCMC. 

We divided our analysis into three phases. The first phase focused on the two 
inner planets, for which the period is firmly determined. A circular orbit was 
assumed for both planets. All transit light curves of the two planets were used as 
input data for this first phase, except the TRAPPIST light curve of 11 December 
2015, for which the transit of planet TRAPPIST-1c is blended with a transit of 
planet TRAPPIST-1d. A preliminary MCMC analysis composed of one chain of 
50,000 steps was first performed to estimate the need to rescale the photometric 
errors, Then a longer MCMC analysis was performed, composed of five chains 
of 100,000 steps, whose convergence was checked using the statistical test of 
ref, 63. The parameters derived from this analysis for the star and its two inner 
planets are shown in Table 1. We performed a similar analysis assuming a uniform 
prior PDF for the stellar radius to derive the value of the stellar density constrained 
only by the transit photometry“, It resulted in a stellar density of 49.37 3°37, in 
excellent agreement with the density of (55.3 + 12.1)@ derived from the a priori 
knowledge of the star, thus bringing a further validation of the planetary origin of 
the transit signals. 

In the second phase of our analysis, we performed 11 global MCMC analyses 
of all transit light curves, each of them consisting of one chain of 50,000 steps and 
corresponding to one of the possible values of the period of TRAPPIST-1d (see 
Table 1) for which a circular orbit was assumed. We then repeated the 11 analyses 
under the assumption of an eccentric orbit for TRAPPIST-1d. We used the medi- 
ans of the BIC posterior distributions to compare the relative posterior probability 
of each orbital model through the formula P1/P2 = e(8!C2-BICi)/2. The resulting 
relative probabilities are given in Extended Data Table 3. The table shows that our 
data favour (with a relative probability of > 10%) a circular orbit and an orbital 
period of between 10.4 and 36.4 days—the most likely period being 18.4 days. 

In the final phase, we performed individual analyses of the light curves to 
measure the mid-eclipse time of each transit to support future TTV studies of the 
system’, The resulting timings are shown in Extended Data Table 4. They do not 
reveal any significant TTV signal, which is not surprising given the amplitude of 
the expected periodicity departures (see below) combined with the limited timing 
precision of the TRAPPIST photometry. 

Extended Data Figs 1 and 2 show the raw and de-trended light curves, respec- 

tively; for each of these, the best-fit eclipse plus baseline model is overplotted. The 
phased-time de-trended light curves are shown for each planet and bandpass in 
Fig. 1. 
Photometric variability of the star. We used the TRAPPIST data set to assess the 
photometric variability of the star at about 900 nm. On a timescale of a few hours— 
corresponding to the typical duration of our observing runs—the star appears 
to be relatively stable, except for the transits and for four sharp, low-amplitude 
increases in brightness (of one to a few per cent) that are followed by exponential- 
type decreases to normal levels within 10-15 minutes (Extended Data Fig. 4), 
which we attribute to flares®°. The low amplitude and inferred low frequency 
(1/60 h~') of these flares is consistent with the reported low level of activity of the 
star’>*? strengthening the inference that the system is not young. 


To assess the lower-frequency variability of TRAPPIST-1, we built its global 
differential light curve in the I +z filter, using four stable stars of similar brightness 
in the TRAPPIST images as comparison stars. We filtered out the flares, transits, 
and measurements taken in cloudy conditions to create the resulting light curve, 
consisting of 12,081 photometric measurements. Extended Data Fig. 5a compares 
this light curve to that for the comparison star 2MASS J23063445 — 0507511. It 
clearly shows some variability at the level of a few per cent, which is consistent 
with previous photometric results obtained in the I-band'®. A Lomb-Scargle (LS) 
periodogram” analysis of the light curve—from which low-frequency variations and 
differential extinction have been filtered out by division of the best-fit fourth-order 
polynomial in time and air mass—reveals a power excess with a period of 
1.4 days (see Extended Data Fig. 5b). Cutting the light curves in two, and in four 
in a second test, and performing a LS analysis of each fraction revealed a power 
excess at about 1.4 days for all of them, supporting a genuine periodic signal of 
astrophysical origin. Associating it with the stellar rotation period, the resulting 
equatorial rotation speed of 4.2km s~! (assuming R, = 0.117 Ro) is consistent with 
the literature measurement!’ for vsini of 6+ 2km s~!, making this association 
physically meaningful. Given the scatter of the peak values obtained in the LS anal- 
yses of the light-curve fractions, we estimate the error bar on the rotation period of 
1.40 days to be 0.05 days. In summary, the photometric variability of the star seems 
to be dominated by the rotation and evolution of photospheric inhomogeneities 
(spots) combined with rare flares. 

Dynamics of the system. We computed the tidal circularization timescales 
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of the three planets according to tore = Rp 
Mp, ranging from 0.45 Earth masses (pure ice composition) to 3 Earth masses 
(pure iron composition)” and a tidal quality factor®, Q, of 100, corresponding 
to the maximum value derived for terrestrial planets and satellites of the solar 
system®. For planets TRAPPIST-1b and -1c, the computed values range from 
22 Myr to 145 Myr and from 177 Myr to 1.1 Gyr, respectively. Taking into account 
that the system is apparently not very young and that the orbits have weak mutual 
perturbations (as they are not close to any mean-motion resonance), our assump- 
tion of circular orbits for the two inner planets is reasonable. On the other hand, 
the same computations result in values ranging from a few to tens of billions of 
years for TRAPPIST-1d, making a significant orbital eccentricity possible from 
a tidal theory perspective. Nonetheless, a nearly circular orbit for this outer planet 
is still a reasonable hypothesis when considering the strong anticorrelation of 
orbital eccentricity and multiplicity of planets detected by radial velocities”, and 
is favoured by our global analysis of the transit photometry (see above). 

We used the Mercury software package’! to assess the dynamical stability of 
the system over 10,000 years for all possible periods of TRAPPIST- 1d. Instabilities 
appeared in our simulations only for the unlikely scenarios of this planet on a 
significantly eccentric (e > 0.4) 4.5-day or 5.2-day orbit. 

To assess the potential of the TTV method?*® to measure the masses of the 

planets, we integrated the dynamical evolution of the system at high sampling over 
two years, assuming Earth masses for the three planets and an 18.4-day circular 
orbit for TRAPPIST-1d. These simulations resulted in TTV amplitudes of several 
tens of seconds, and led us to conclude that, with an intensive transit monitoring 
campaign—with instruments able to reach timing precisions of a few tens of seconds 
(for example, with VLT/HAWK-I or UKIRT/WFCAM; Extended Data Table 4)—it 
should be possible to constrain the planetary masses. 
Planets’ suitability for atmospheric characterization. We estimated the typical 
signal amplitude in transit transmission spectroscopy for all the transiting exo- 
planets with a size equal to or smaller than that of the mini-Neptune GJ 1214b (ref. 
72). We computed this amplitude as 2Rphe/ Ris where R, is the planetary radius, 
heggis the effective atmospheric height (that is, the extent of the atmospheric annu- 
lus), and R, is the stellar radius. The effective atmospheric height is directly pro- 
portional to the atmospheric scale height, H=kT/,ug, where k is Boltzmann’s 
constant, T is the atmospheric temperature, ju is the atmospheric mean molecular 
mass, and gis the surface gravity. The ratio heg/H for a transparent atmosphere”*”? 
is typically between 6 and 10, and thus depends strongly on the presence of clouds 
and the spectral resolution and range covered. Our estimates (Fig. 2) are based on 
an heg/H ratio of 7 and the conservative assumption of a volatile-dominated atmos- 
phere (ss = 20) with a Bond albedo of 0.3. All other parameters for the planets were 
derived from exoplanets.org”. As an illustration, the maximum transit depth var- 
iations projected under those assumptions for GJ 1214b are about 250 p.p.m., in 
agreement with independent simulations”». 

For the same sample of planets, we also derived the typical SNRs in transit trans- 
mission spectroscopy from the ratio of our computed signal amplitudes over the 
square root of the flux (determined from the J-band magnitudes of the host stars). 
The SNRs of TRAPPIST-1’s planets in transmission are expected to range between 
0.22 and 0.55 times that of GJ 1214b under the same theoretical assumptions, 
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suggesting that these planets are well suited for atmospheric studies with HST/ 
WEC3 similar to those previously targeting GJ 1214b (refs 75, 76). 

Given published simulations for terrestrial planets”, we estimate that charac- 
terization of TRAPPIST- 1b, -1c and -1d should require up to 70 hours, 90 hours 
and 270 hours, respectively, of in-transit observations with the James Webb Space 
Telescope (JWST), and should yield atmospheric temperatures with relative 
uncertainties below 15% and abundances within a factor of four. Assuming that 
the atmospheres of TRAPPIST-1’s planets are not depleted and do not harbour a 
high-altitude cloud deck, JWST should, notably, yield constraints on the abun- 
dances of molecules with large absorption bands such as H2O, CO, CHy, CO and 
O; if their abundances are at or greater than the 10-p.p.m. level. 

We also assessed the potential of the cross-correlation technique” to constrain 

the atmospheric properties of the TRAPPIST-1 planets, following a published 
formalism’*, We find that detecting O, in TRAPPIST-1’s planets should require 
up to 80 transit observations with one of the next-generation, giant ground-based 
telescopes. Taking into account the limited fraction of transits visible at low air 
mass, such an endeavour could be reached in 5 to 15 years. 
Code availability. Equivalent widths and H,O-K2 index measurements in the 
SpeX spectra were made using the IDL program created by A. Mann and dis- 
tributed at http://github.com/awmann/metal. Conversion of the ur times for the 
photometric measurements to the BJDrpp system was performed using the online 
program created by J. Eastman and distributed at http://astroutils.astronomy.ohio- 
state.edu/time/utc2bjd.html. The Image Reduction and Analysis Facility IRAF) 
software is distributed by the National Optical Astronomy Observatory, which 
is operated by the Association of Universities for Research in Astronomy, Inc., 
under cooperative agreement with the National Science Foundation. The MCMC 
software used to analyse the photometric data is a custom Fortran 90 code that 
can be obtained upon request. 
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Extended Data Figure 1 | Raw TRAPPIST-1 transit light curves. The 
light curves are shown in chronological order from top to bottom and left 
to right, with unbinned data shown as cyan dots, and binned 0.005-day 
(7.2-minute) intervals shown as black dots with error bars. The error bars 
are the standard errors of the mean of the measurements in the bins. 
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The best-fit transit-plus-baseline models are overplotted (red line). The 
light curves are phased for the mid-transit time and shifted along the 

y axis for clarity. For the dual transit of 11 December 2015, the light 
curve is phased for the mid-transit time of planet TRAPPIST-1c. T1b, 
TRAPPIST-1b; T1c, TRAPPIST-c; T1d, TRAPPIST-1d. 
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Extended Data Figure 2 | De-trended TRAPPIST-1 transit light curves. The details are as in Extended Data Fig. 1, except that the light curves shown 
here are divided by the best-fit baseline model to highlight the transit signatures. 
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Extended Data Figure 3 | Near-infrared spectra of TRAPPIST-1. TRAPPIST-1 in the 2.17-2.35-1m region. Nat, Cat and CO features are 
a, Comparison of TRAPPIST-1’s near-infrared spectrum (black)— labelled. Additional structure primarily originates from overlapping H,O 
obtained with the spectrograph IRTF/SpeX**—with that of the M8-type bands. The spectrum is normalized at 2.2 1m. F), spectral flux density; 
standard LHS132 (red). b, Cross-dispersed IRTF/SpeX spectrum of fx, normalized spectra flux density. 
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Extended Data Figure 4 | Flare events in the TRAPPIST 2015 photometry. The photometric measurements are shown unbinned (cyan dots) and 
binned per 7.2-minute interval (black dots). For each interval, the error bars are the standard error of the mean. 
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Extended Data Figure 5 | Photometric variability of TRAPPIST-1. 
a, Global light curve of the star as measured by TRAPPIST. The 


photometric measurements are shown unbinned (cyan dots) and binned 


per night (black dots with error bars (+s.e.m.)). This light curve is 
compared with that of the comparison star 2MASS J23063445 — 05075 


1 


Phased time 
shifted along the y axis for clarity. b, The same light curve for 
TRAPPIST-1, folded on the period P= 1.40 days and binned by 10-minute 
intervals (error bars indicate +s.e.m.). For clarity, two consecutive periods 
are shown. 
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Extended Data Table 1 | TRAPPIST-1 transit light curves 


Date Instrument Np Texp Baseline Transit(s) 
function 


p(t?+xy?+f?) TRAPPIST-1c 


05 Nov 2015 
07 Nov 2015 
TRAPPIST p(C+xy?+f) 
VLT/HAWK-I 17x1.7s 

lz 
Itz 

1312 
J 175 | 5xls 
1109 [_Sxls 
55s 
55s 


For each light curve, the date, instrument, filter, number of points (Np), exposure time (Texp), and baseline function are given. For the baseline functions, p(t), p(xy*) and p(f*) denote, respectively, 
second-order polynomial functions of time, of the x and y positions, and of the full-width at half-maximum of the stellar images. 
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Extended Data Table 2 | Quadratic limb-darkening coefficients 
| Bandpass || 


J (UKIRT/WFCAM 0.1040.05 0.57+0.02 
NB2090 (VLT/HAWKI) 0.0440.03 0.5040.03 


We inferred these values and errors for the quadratic coefficients u; and uz for TRAPPIST-1 from theoretical tables®’, and used the values and errors as a priori knowledge of the stellar limb-darkening 
in a global MCMC analysis of the transit light curves. The error bars were obtained by propagation of the errors on the stellar gravity, metallicity, and effective temperature. 
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Extended Data Table 3 | Posterior likelihoods of the orbital solutions for TRAPPIST-1d 


period (d) orbit orbit fai oo 
| 0.98 


| 36.408 | 0.122 | 0.0014 || 0.093 | 0.06 
ee 


The likelihoods shown for the circular orbits are normalized to the most likely solution (that is, eet orbit of P= 18.204 days (d)). For each orbit, the semi-majo ical 
nits (AU)), assuming a stellar mass Seer a es e 1), and the mean irradiation, S, (in Earth units (Sear os own. 
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Extended Data Table 4 | Individual mid-transit timings measured for the TRAPPIST-1 planets 


Mid-transit timing 
(BJD ps-2,450,000) 


TRAPPIST-1b | TRAPPIST | 0 | = 7322.5161799913 
TRAPPIST 73255391 oe 

TRAPPIST | 6 |  7331.5803+0.0013 

TRAPPIST | 8 |  7334.6038+0.0012 
VLT/HAWK-I | 8 | 7334.60490+0.00020 

TRAPPIST 7337.6249+0.0010 

TRAPPIST 7340.64.7410.0010 
HCT/HFOSC 7345.18011+0.00089 
UKIRT/WFCAM 7361.79960+0.00030 
UKIRT/WFCAM 7364.82137+0.00056 

TRAPPIST-1c TRAPPIST | 0 |  7282.8058+0.0010 
TRAPPIST 7333.6633+0.0010 
UKIRT/WFCAM 7362.72623+0.00040 

TRAPPIST 7367.5699+0.0012 

TRAPPIST 7384.5230+0.0011 

TRAPPIST-1d RAPPIST | 0 | — 7294.7736+0.0014 
RAPPIST _ a 7367. 58180. 0015, 


The transit timings shown were deduced from individual analyses of the transit light cu ular orbits for the planets. The espond to the 1a limits of the posterior PDFs of 
the transit timings. 
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Lightwave-driven quasiparticle collisions on a 


subcycle timescale 


FE. Langer!, M. Hohenleutner!, C. P. Schmid!, C. Poellmann!, P. Nagler', T. Korn!, C. Schiiller!, M. S. Sherwin’, U. Huttner?, 


J. T. Steiner, S. W. Koch, M. Kira? & R. Huber! 


Ever since Ernest Rutherford scattered c-particles from gold foils’, 
collision experiments have revealed insights into atoms, nuclei 
and elementary particles”. In solids, many-body correlations lead 
to characteristic resonances*>—called quasiparticles—such as 
excitons, dropletons*, polarons and Cooper pairs. The structure 
and dynamics of quasiparticles are important because they define 
macroscopic phenomena such as Mott insulating states, spontaneous 
spin- and charge-order, and high-temperature superconductivity°. 
However, the extremely short lifetimes of these entities® make 
practical implementations of a suitable collider challenging. Here 
we exploit lightwave-driven charge transport’, the foundation of 
attosecond science?"!%, to explore ultrafast quasiparticle collisions 
directly in the time domain: a femtosecond optical pulse creates 
excitonic electron-hole pairs in the layered dichalcogenide tungsten 
diselenide while a strong terahertz field accelerates and collides 
the electrons with the holes. The underlying dynamics of the wave 
packets, including collision, pair annihilation, quantum interference 
and dephasing, are detected as light emission in high-order spectral 
sidebands!7-!? of the optical excitation. A full quantum theory 
explains our observations microscopically. This approach enables 
collision experiments with various complex quasiparticles and 
suggests a promising new way of generating sub-femtosecond pulses. 

Following the principle of high-energy accelerators, their solid-state 
counterparts should prepare, accelerate and collide quasiparticles 
while detecting the outcomes. This can be realized only if all steps are 
accomplished faster than the ultrashort lifetime of the quasiparticle’. 
Preparing the quantum states of quasiparticles by femtosecond lasers is 
already well established?°. Ultrafast acceleration, in turn, may harness 
the latest breakthroughs of strong-field control: the carrier wave of an 
intense laser pulse has been applied to ionize atoms or molecules and 
recollide their fragments when the sign of the light-field changes'®. The 
excess energy of the collision is released as high-order harmonics of 
the laser frequency, which may emerge in attosecond bursts’, carrying 
key structural and dynamical information about the participants!!". 
Recently, this idea of subcycle control has been extended to solids, 
in which lightwaves have driven dynamical Bloch oscillations”®”!, 
interband quantum interference”* and high-order harmonic genera- 
tion?” 4 (HHG). However, the microscopic mechanisms are complex 
because the lightwave drives an inseparable combination of interband 
excitation”)**”° (quasiparticle preparation) and intraband accelera- 
tion?” *4”°. The recent observation of high-order sideband generation 
(HSG) in gallium arsenide quantum wells’*® has suggested a way to 
disentangle these steps. Following optical preparation of coherent exci- 
tons”’, a strong terahertz (THz) field of frequency 4, modulates the 
interband resonance such that spectral sidebands at even multiples of 
VTHz emerge’”, This observation has been modelled by THz-driven 
electron-hole collisions, similarly to the three-step model of HHG in 
atomic gases”®. Yet, the potential of HSG for real-time quasiparticle 
collisions has remained untapped. 


Here, we introduce a subcycle quasiparticle collider. A femtosecond 
optical pulse prepares coherent excitons in tungsten diselenide (WSez), 
at a well-defined phase of a THz driving field (Eyy,). This phase defines 
how Eryz accelerates and collides electrons and holes, thereby yielding 
photon emission by pair annihilation, which is detected as high-order 
sidebands. We reveal the subcycle quantum dynamics of quasiparticle 
collisions, model the microscopic process by a full quantum theory and 
use the data to extract key information on the colliding species. Finally, 
we compare the process to HHG in the same material. 

As a member of the class of transition metal dichalcogenides, WSe2 
features strongly bound Wannier excitons”””* and unique spin-valley 
coupling”’. Even in its bulk form, WSe, has been reported to exhibit an 
exciton binding energy of the order of 0.1 eV at its direct bandgap*”, 
which implies that these quasiparticles are stable at room temperature. 
Ina first experiment, a 100-fs excitation pulse (red waveform in Fig. la) 
centred at a photon energy of 1.621 eV (centre frequency, 392 THz) 
resonantly prepares coherent excitons at the direct band gap of WSe2 
(thickness, 60 nm) located at the K and K’ points in momentum space. 
A co-propagating intense THz pulse (blue waveform in Fig. 1a, b; centre 
photon energy, hvpy, =95 meV (h is the Planck constant); centre fre- 
quency, rH, = 23 THz; peak electric field in air, Eyeak= 17 MV cm™ 1 
see Methods section ‘Experimental set-up’) modulates the excitonic 
polarization and gives rise to HSG'®*"’. The resulting intensity spectrum 
TIysc includes sidebands of positive and negative orders n, spaced at 
2Vpyz (Fig. 1b). Spectral components up to n = 22 are clearly visible, 
while the intensities of negative orders drop much faster with decreas- 
ing frequencies. The new frequency components feature the same 
polarization as the interband excitation pulse (Extended Data Fig. 1). 
In contrast to perturbative scaling (Iysc¢ oc E an the sideband intensity 
grows linearly with Epeak for intermediate peak fields, depending on the 
sideband order, and even saturates at the highest field strengths 
(Fig. 1c, see also Extended Data Fig. 2 for the influence of the interband 
excitation fluence). Finally, Iysq peaks if the interband excitation is 
resonant with the excitonic absorption peak (Fig. 1d), underpinning 
its excitonic origin. Apart from the broadening of the sidebands due to 
the ultrashort duration of Ey}, these findings agree well with previous 
time-integrated studies’®. 

To directly resolve the underlying subcycle quantum dynamics, we 
now prepare the coherent excitons within 10 fs—much faster than a 
single oscillation period T = vz,,,0f the THz wave. Figure 2a depicts 
the spectral shape of the observed sideband intensity Isc as a function 
of the delay between the excitation pulse and the peak of the THz 
driving field, t.,. Electro-optic detection allows us to determine the 
complete waveform Ey}, on the same absolute timescale (Fig. 2a, b, 
blue curve). The spectrogram in Fig. 2a follows a nearly Gaussian 
temporal envelope, resembling the THz envelope. Interestingly, Insc 
is strongly modulated along the time axis with a period of T/2; that 
is, there is a subcycle criterion for ‘good’ and ‘bad’ preparation 
times fey. 


1Department of Physics, University of Regensburg, 93040 Regensburg, Germany. @Department of Physics and the Institute for Terahertz Science and Technology, University of California at Santa 
Barbara, Santa Barbara, California 93106, USA. *Department of Physics, University of Marburg, 35032 Marburg, Germany. 


12 MAY 2016 | VOL 533 | NATURE | 225 


© 2016 Macmillan Publishers Limited. All rights reserved 


LETTER 


a c @ Second-order @ Sixth-order d Frequency (THz) 
E @ Fourth-order @ Eighth-order 410 390 370 350 
—@-— Fourth-order ~ 
$ sai R-2- Sixth-order 5 
> i = 
Caer c o 
Z 10 & Po 
= = i 2 
= aed 5 
a g 3 
4S . : 8 
= 10° 1 £ 
6 6 
= 40-4 = 
1 3.5 10 20 725 750 775 800 825 850 
Peak field strength, E,.2, (MV cm") Wavelength (nm) 


b = 

156 wh 
ot 

= 0 25 

3 oF 

& = = 

3 -15 8 

2 T T T w 

s -200 -100 0 

7 Delay time (fs) 

5 42 be y (fs) 

€ 18 20 


300 400 500 


800 900 


Frequency (THz) 


Figure 1 | High-order sideband generation in tungsten diselenide. 

a, Schematic of the experiment in reciprocal space (E, energy; k, wave 
vector): an interband excitation pulse (red waveform) creates an excitonic 
polarization in WSe (red, vertical arrow), while a strong multi-THz field 
(blue waveform) simultaneously accelerates the wave packets of the 
electron and hole (curved arrows) within their respective bands (grey 
shaded parabolas). b, Measured intensity spectrum (red) of high-order 
sidebands from WSe, (thickness, 60 nm; sample at room temperature) 
driven by a phase-locked THz transient featuring a centre frequency of 
23 THz and an external peak field strength of Ery,=17 MV cm | (see inset; 
Keldysh parameter y= 0.08 < 1) under resonant optical excitation at a 
frequency of 392 THz (labelled by ‘0; Oth order has been multiplied by a 


For a quantitative timing analysis, Fig. 2b compares Ey; (blue curve) 
with the spectrally integrated intensity Iysg (red shaded curve). The 
strongest THz half-cycle occurs at a delay dgiobal * T with respect to the 
most-intense sideband peak. This retardation rules out instantaneous 
optical nonlinearities as the microscopic origin of HSG. A detailed 
comparison of Eyy, and Iysq also reveals a subcycle delay 6,, of the 
observed sideband bursts relative to the THz crests (see inset in Fig. 2b). 
For subsequent half-cycles of the driving field, 6,, initially increases, 
reaches a maximum value of approximately T/8 at the centre of the 
THz pulse and slightly decreases afterwards (Fig. 2d). 

To link these signatures of Iysg with the microscopic quasiparticle 
motion, we analyse the experiment using a full quantum mechani- 
cal model based on the semiconductor Bloch equations (see Methods 
section ‘Microscopic model of HSG’). For a 100-fs preparation pulse, 
the experimentally observed HSG spectrum is well reproduced by our 
calculations (Fig. 1b, black dashed curve), showing a similar modula- 
tion and intensity roll-off on the high- and low-frequency sides. Using 
a 10-fs preparation pulse, the calculations quantitatively reproduce the 
temporal characteristics of Iysq (Fig. 2c), including even delicate fea- 
tures such as 6,, (Fig. 2d). 

Figure 3 visualizes the underlying microscopic quasiparticle dynamics 
for two characteristic preparation times f,,, corresponding to mini- 
mum (te, = —4 fs, Fig. 3c, e, g) and maximum (f,, =7 fs, Fig. 3d, f, h) 
HSG. The two quasiparticle-collider sequences are schematically 
illustrated by Feynman diagrams in Fig. 3a, b. For tex = —4 fs, the 
coherent excitons are prepared shortly before a THz field crest (see 
vertical dashed lines in Figs 2c and 3c). Immediately after optical 
preparation, the average momenta of the electron (Fig. 3e, solid curve) 
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factor of 10~+; other numerals denote the order of sidebands). The black 
dashed curve shows the calculated intensity spectrum Iysg. a.u., arbitrary 
units. c, Recorded high-order sideband intensity Iyjsg of orders (n) two to 
eight as a function of driving peak field strength Eye... Dotted lines follow 
a perturbative scaling law, Insc o Eos dashed lines mark a linear scaling; 
the grey shaded area indicates the noise level. d, Measured generation 
efficiency of the fourth- and sixth-order sideband for different excitation 
wavelengths (red spheres). Error bars represent the bandwidth (full-width 
at half maximum) of the excitation spectra (excitation spectrum at 

392 THz shown as a dashed curve). The shaded area shows the measured 
exciton resonance in the absorption spectrum of the sample. 


and the hole (dashed curve) are located in the vicinity of the K and K’ 
points of the Brillouin zone (defined as k=0). The strong THz field 
accelerates the distribution function of the electron ( i, ; Fig. 3e, 
false-colour plot) and the hole (f a not shown) away from k=0, driv- 
ing the electron and hole wave packets apart. A qualitatively different 
situation occurs for t..=7 fs (Fig. 3d, f, h), for which the coherent 
excitons are prepared shortly before a zero crossing of the THz field 
(vertical dashed lines in Figs 2c and 3d). Now the mean momenta 
change sign; that is, the initially separating electron and hole are set 
on a recollision path (Fig. 3f). 

Particle preparation of typical accelerators is spatially precise and 
collisions are monitored in real space. Our concept follows the relative 
motion of quasiparticles, and the preparation is precise in terms of the 
relative coordinates of the quasiparticles because the optical pulse cre- 
ates them in a single, spatially correlated excitonic state described by a 
pair correlation function’ (see Methods section ‘Coherent electron- 
hole pair-correlation functior). Figure 3g, h displays its coherent part 
&con(r), which defines how electrons are coherently coupled to holes as 
a function of their relative distance r (reciprocal space representation 
in Extended Data Fig. 3). In both panels, g.on(r) is maximum close to 
the peak of the optical pulse before it decays by ultrafast dephasing 
(dephasing time T> < T/2). Figure 3c, d shows the corresponding mean 
electron-hole separation (r). Suitable t., (Fig. 3d) allows the THz field 
(Fig. 3c, d, blue curve) to separate and collide the electron-hole pairs 
at high energy. Owing to this ballistic motion, maximum HSG occurs 
for preparation times that are offset with respect to the field crests, 
causing the distinct delay 5,. observed in Fig. 2d. The precise temporal 
structure seen in the experiment can be reproduced only if 
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Figure 2 | Subcycle electron-hole recollisions. a, Spectrally resolved 
high-order sideband intensity Iysg (colour scale) as a function of delay 
time f.. between the THz driving field (Erp, blue curve, see also b) and 

a 10-fs interband excitation pulse. b, c, Measured (b) and calculated (c) 
high-order sideband intensity Iyjsg (red, spectrally integrated between 
435 THz and 650 THz) on the same timescale as the driving waveform 
(Ernz, blue), which peaks with a global delay dgiopa after IgG. On a subcycle 


excitation-induced dephasing’ is taken into account (see Methods 
section ‘Effect of excitation-induced dephasing on HSG’ and Extended 
Data Figs 4-6). By contrast, for ‘bad’ timing f,,, the electron and the 
hole are monotonically driven apart (Fig. 3c), suppressing collisions 
(see Fig. 3a). The situation is analogous to a cyclotron, in which elec- 
trons are effectively accelerated only if they are injected at the right 
phase of the alternating accelerator field. 

Whereas the average intraband dynamics (r) resembles semiclassical 
trajectories (see Fig. 3c, d), the connection between quasiparticle 
collision and HSG requires a full quantum theory. Similarly to electron- 
positron pairs in vacuum, colliding electron-hole pairs in a solid are 
subject to pair annihilation, during which a high-energy photon may 
be emitted, leading to sideband signals; this microscopic annihilation 
dynamics manifests itself in g-on(r) (Fig. 3h). After substantial spreading 
of g-on(r), electron-hole collision is prompted by an abrupt termination 
of the correlation function due to pair annihilation. Subsequently, inter- 
ference patterns occur among the residual fragments that survive the 
collision (see black ellipse in Fig. 3h). These distinct collision-induced 
annihilation features are absent in the case of ‘bad’ injection times, for 
which the coherence gradually abates owing to dephasing (Fig. 3g). The 
details of g-on(r) are a direct manifestation of the many-body inter- 
actions governing the internal structure of the exciton, and are evident 
in the temporal and spectral response in HSG. 

Detection of HSG, combined with precise preparation and accelera- 
tion, delivers an in-depth view of quasiparticles. As in particle colliders, 
one can expose the scattering products by systematic variation of key 
parameters, such as the strength, frequency and duration of the prepa- 
ration and THz pulses, and their relative delay t,x. In Methods section 
‘Data landscape of quasiparticle colliders’ we provide first examples of 
the ways in which to retrieve material-specific information such as the 
exciton binding energy, scattering times and Coulomb enhancement 
of the collisional cross-section via dedicated analyses of the multi- 
dimensional data landscape of Iyjsc. In particular, the collision events 


ir) 
Normalized E,,,, 
ou 
T 
( Fes 


LETTER 


9 global 


F 1.0 


I 
= 
f 


Experiment 


Normalized E,,,, 
o- 
Lo 


F 0.5 


Normalized Iigg 


i] 
; 
i) 
-1- Theory ; 
i) 
i) 
1 
i) 
1 


T 

oO 

a 
Normalized Io 


| 0.0 
-150 -100 -50 0 50 100 150 
Delay time, t,,, (fs) 
2 
ia ; 
Bc 014 | 3 Q if 
> 
SS8 004-9 
zo ~ i 
3 -o4 @ Experiment mney 


T 
-50 0 50 
Delay time, t,,. (fs) 


scale, the recorded sideband intensity (red) peaks at a distinct time delay 
ds after the nearest extrema of the driving waveform (see close-up in b). 
d, Subcycle delay of Ips in units of the driving period T for subsequent 
driving half-cycles at their respective delay times as measured (bright 
red spheres; error bars, standard deviation of 6,. for 25 consecutive 
measurements) and calculated (dark red spheres). The horizontal black 
dashed line marks 6,-T~! =0. 


recorded in our experiments bear the hallmarks of an exciton binding 
energy around 60 meV. 

Well-defined initial momenta and relative spatial coordinates of the 
electrons and holes are prerequisites for controlled collisions. By con- 
trast, the direction of acceleration in the WSe, crystal is less relevant, 
as seen in Fig. 4: the high-order sideband spectra show virtually no 
dependence on the azimuthal in-plane angle y of the THz polariza- 
tion. This behaviour is compatible with our theoretical expectation 
that coherent electron-hole wave packets in HSG remain rather close 
to k=0 (Fig. 3f). Both Coulomb attraction between electrons and 
holes and fast dephasing keep the coherent electron-hole excursions 
within a region in which the effective mass is approximately isotropic 
(see Fig. 1a). Additionally, HSG probes the interband polarization of 
colliding electrons and holes only at spatial overlap”’ (inset of Fig. 4a). 

This unique characteristic of the quasiparticle collider becomes par- 
ticularly clear when we compare the dynamics with HHG in the same 
sample, which is achieved by applying the strong THz field without 
optical preparation of excitons. A typical high-harmonic spectrum 
exhibiting odd orders up to n’ = 47 is depicted in Extended Data Fig. 7. 
The intensity of HHG is smaller by approximately one order of mag- 
nitude than HSG for frequencies below 650 THz. Owing to the non- 
perturbative, non-resonant excitation of Bloch electrons with 
simultaneous acceleration by the strong-field bias?”?!4”°, the initial 
carrier distribution is smeared out throughout the entire Brillouin zone. 
Furthermore, coherent interband polarization can emit photons at any 
point in momentum space (see inset of Fig. 4b). This situation man- 
ifests itself by a critical dependence of the intensity and the shape of 
high-order harmonics on ¢ (Fig. 4b), reflecting the six-fold symmetry 
of WSez in the layer plane. 

In conclusion, we demonstrate the realization of quasiparticle col- 
lisions in the time domain. We excite, accelerate and collide coherent 
electron-hole pairs in WSe; resulting in ultrabroad sideband emission. 
On asubcycle scale, we directly trace the retarded, ballistic wave-packet 
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Figure 3 | Quantum simulation of subcycle electron-hole collisions 
underlying HSG. a, b, Schematic Feynman diagrams depicting electron— 
hole (e-h) pair creation by a near-infrared photon (hv,,,) and acceleration 
by Ez: only for ‘good’ excitation times (b), the acceleration leads to a 
collision and annihilation of the electron-hole pair, thereby emitting a 
sideband photon hvysc. ¢, d, Trajectories (red, the intensity of the line 
represents the density of coherent excitons) tracing the real-time evolution 
(time t) of the mean electron-hole separation (weighted average of g-on(1)) 
for characteristic delays t., corresponding to minimum (c) and maximum 
(d) HSG emission (compare with Fig. 2c). Vertical black dotted lines 
highlight t=0, which marks the peak of the excitation pulse. Although 
electrons and holes are initially separated, they rapidly recollide (zero 
excursion marked by the black horizontal lines; time of recollision is 
highlighted by the black arrow in d) upon reversal of the driving field (blue 
curves), inducing a strong HSG signal (d). For ‘bad’ excitation times, the 
electron-hole separation increases monotonically, prohibiting recollisions 
(c). e, f, Occupation f is (colour scale) of the first conduction band as a 
function of time t and crystal momentum k (in units of the wave 

vector kpz limiting the first Brillouin zone) for delays of t.. = —4 fs and 7 fs, 
respectively. Horizontal white lines mark k = 0. Black solid (dashed) curves 
trace the weighted average excursion of electrons (holes) in reciprocal 
space. g, h, Coherent electron-hole correlation function gcon(r) (colour 
scale) as a function of time t and real-space coordinate r. Interference 
patterns occur after the abrupt collapse of coherence caused by 
electron-hole recollision (highlighted by the black ellipse in h), 

and are absent for f.. = —4 fs (g). Horizontal white lines mark r=0. 
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Figure 4 | Experimental comparison of high-order sideband and high- 
order harmonic generation. a, b, Spectrally resolved high-order sideband 
intensity Iysg (a; narrow-band exciton preparation at a frequency of 

392 THz) and high-order harmonic intensity Iq (b) as a function of the 
azimuthal orientation (angle yy) of the WSe2 sample (normal incidence). 
Both intensity (colour) scales are normalized to the maximum HSG signal. 
Whereas high-order sidebands show virtually no dependence on y, the 
high-harmonic spectra reflect the six-fold symmetry of WSep. a, Inset, 
real-space visualization of THz-driven electron-hole recollisions: upon 
excitation of a bound electron-hole pair (grey), the carriers (red and blue 
wave packets) are accelerated and recollided by the strong light field (blue) 
to recombine at r= 0, giving rise to high-order sideband emission. b, Inset, 
schematic of high-order harmonic generation: a polarization between 
valence and conduction band (grey) is induced by a strong multi-THz 
field (blue) and simultaneously accelerated within the bands (red and blue 
spheres, curved red and blue arrow). During this process, the coherent 
interband polarization is continuously modified (red vertical arrows). 


dynamics and retrieve key information such as the exciton binding 
energy, scattering times and the Coulomb enhancement of the col- 
lisional cross-section. As in conventional particle accelerators, our 
optical excitation pulse prepares electron and hole wave packets with 
uncertainty-limited momenta and relative spatial coordinates. Yet 
their absolute position is defined only to within the relatively large 
focal spot size, which may be overcome in future near-field experi- 
ments by preparing excitons in a nanometre-sized interaction region. 
Both in the quasiparticle collider and in its large-scale counterpart, 
intrinsic characteristics of the collision become accessible by detect- 
ing the concomitantly emitted high-frequency radiation. The high 
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density of quasiparticles in solids helps to generate relatively intense, 
all-coherent, ultrabroadband radiation, which may pave the way 
to new sub-femtosecond light sources; furthermore, the subcycle 
modulation of HSG may facilitate information processing at opti- 
cal-clock rates. Large-scale colliders have enabled the detailed study 
of elementary particles. Likewise, we anticipate that THz-driven, 
real-time collisions—operating on much smaller, widely tunable, 
length and energy scales—will be key to studying complex quasi- 
particles in numerous applications of modern materials science; 
examples include excitons, trions, biexcitons or dropletons in quasi- 
two-dimensional atomic monolayer systems, Dirac-like quasiparti- 
cles in graphene and topological insulators, or polarons in materials 
featuring strong electron-phonon coupling. This scheme may also 
be extended to optically excited quasiparticles in unconventional 
superconductors, and could help to resolve some of the outstanding 
questions in condensed matter research. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 


Experimental set-up. Intense, phase-locked waveforms in the far- to mid- 
infrared spectral region (multi-terahertz range) are generated by difference fre- 
quency mixing of two spectrally detuned, near-infrared pulse trains from a dual 
optical parametric amplifier pumped by a titanium-sapphire laser amplifier??4. 
The few-cycle transients feature peak fields of up to approximately 1 V A“! (ref. 31). 
A super-continuum source based on filamentation in an yttrium aluminium gar- 
net crystal delivers ultrabroadband white-light pulses covering the whole near- 
infrared and visible range. Both pulse trains are collinearly superimposed with an 
indium-tin-oxide-coated beam splitter and focused onto the tungsten diselenide 
sample under normal incidence using a gold-coated parabolic mirror. A mechan- 
ical delay stage in the excitation beam path allows us to temporally delay the pulses 
with respect to each other. 

The sample under study is a 60-nm-thick sheet of WSe, that has been cleaved 
using a viscoelastic gel film and placed on a diamond substrate grown by chemical 
vapour deposition. For resonant, narrow-band excitation of the excitonic polar- 
ization in WSe, optical band-pass filters featuring a transmittance bandwidth of 
10nm are used (see Fig. 1d). For the time-resolved measurements, the bandwidth 
of the excitation pulse is adjusted in the Fourier plane of a prism compressor, which 
enables the required pulse duration of 10 fs. Spectral components of the excitation 
pulse above 400 THz are filtered out to avoid the resonant generation of a high 
density of unbound electron-hole pairs. All experiments were performed at room 
temperature and in ambient air. No sample degradation was observed during the 
whole course of experiments. 

The spot sizes of the excitation pulse and the THz pulse at the sample are 22 1m 
and 85\.m, respectively (intensity full-width at half-maximum). The generated 
high-order sideband intensity Isc and the high-order harmonic intensity I} are 
recorded with a spectrograph featuring a cooled silicon CCD camera. All spectra 
are corrected for the grating efficiency and the quantum efficiency of the detector. 

The pulse energy of the narrowband excitation amounts to approximately 0.1 nJ 
in our current experiments. A comparison of the spectral components of the excita- 
tion and the sideband signals yields an energy conversion efficiency of approxi- 
mately 0.1% in the 60-nm-thick WSe2 sample for the applied excitation fluence 
(approximately 25 1J cm’). Therefore, we estimate the energy in the high-order 
sideband pulse train as roughly 0.1 pJ. 

Replacing the WSe2 sample by a 6.5-j1m-thick zinc telluride (ZnTe) detector 
crystal allows for electro-optic detection of the driving field. To this end, we care- 
fully place the crystal into the THz focus at the same position and account for 
the complex detector response*’. The delay time t,, between the THz waveform 
and the near-infrared excitation pulse enables a direct temporal correlation of the 
driving waveform with the recorded trace of high-order sideband emission. To 
describe the microscopic dynamics following exciton creation, a second timescale 
t is introduced, which describes the real time evolving for a fixed value of f., as 
shown, for instance, in Fig. 3. 

Microscopic model of HSG. A complete quantum description of optical***4 and 
THz excitations”!**3>%6 of solids follows from the semiconductor Bloch equa- 
tions’” (SBEs). Compared to earlier HSG models!”, the SBEs fully include many- 
body effects beyond isolated electron-hole pairs, the interplay of HSG versus HHG, 
non-parabolic dispersions, and dephasing and relaxation processes. The SBEs are 


in” p, = exp, —AOA(t)(1—fe —f",) + ilelErun(t) » Vidy + Th 
Ot k k 


Gs ; pt Te 
hat = — 2m[hQx(t)pf] + |e|Erue(t) - Vite + Tk (1) 
nh = — 2Um[A2_x(t)p* ,] + lelErnia(t) - Vf", + Tp 


in which pj is the complex conjugate of the microscopic interband polarization px 
and f° P ( Ve h) defines the electron (hole) occupation in a conduction-band (valence- 
band) state with a crystal momentum /ik (f is the reduced Planck constant). The 
equations also contain the strength of the field coupling via (2, and the renormal- 
ized single-particle energies &,; the elementary change is denoted by e, and scatter- 
ing contributions are considered by Ij, y,, and rT x- Physically, pz defines the 
transition amplitude of exciting an electron from one band to another while hk is 
conserved. As an optical field Eop:(¢) induces such a transition, it eventually gen- 
erates occupations fy and ft The presented SBEs can be generalized in a straight- 
forward manner for ‘nultipte bands?}378, 

As an inverse process, electrons recombine with holes via photon emission. The 
coherent part of this process yields radiative decay of the polarization”’ and the 
corresponding emission spectrum is*? [= |wP(w) + hl at ai hw. The 
Fourier transforms of the macroscopic polarization P(t a Py + e.c. 


(in which c.c. represents the complex conjugate) and the cnacroaeapte current 


I(t) =F er contain the dipole and current matrix elements d.,(k) and 
k 


a 
aa oN 
energy dispersion e,. 

Because electrons and holes are charged particles, they experience a strong 


Yep, respectively. The currents in a given band \=e, h are defined by its 


Coulomb force, which creates non-trivial many-body effects among px, f° and f : 
during their finite lifetimes. For example, the strength of the field excitation 
becomes renormalized 

AQz(t) = 


dx(k) . E(t) + ye Vi k'P 5! 
ki 


by the Coulomb matrix element V;, while its un-normalized part is a product of 
d.,(k) and the total field E= E,),+ Err, containing also the THz field Ey,. The 
Coulomb sum within (2;(¢) results in excitonic resonances in the polarization””. 
The single-particle transition energy €, becomes renormalized too 


B= ee— >) Vi-w(fe thy) 
k! 


Additionally, phonon, photon and Coulomb interactions yield scattering contri- 
butions Ij, I’, and Pp. due to high-order correlations” . We include microscopic 


scattering with the approximation [j, = — iz o = Pp Which includes excitation- 
2 


induced dephasing (EID) via the k-dependent scattering time T,(k), as explained 
in Methods section ‘Effect of excitation-induced dephasing on HSG’. 

Microscopically, P’ and I” induce a relaxation of electron and hole distributions, 
which we implement using a phenomenological relaxation towards a symmetric 
carrier distribution via 7 = — —( : +f ip k )on a timescale defined by 7 (ref. 24). 


In our computations, 7 is chosen to be 6 fs. 

Our experiments show that high-order sideband generation (HSG) excites car- 
riers only in the vicinity of the K point where WSe; electrons and holes can be 
described with an effective mass approximation. To determine the general aspects 
of the interplay between HSG and high-harmonic generation (HHG), we use a one- 
dimensional (1D) two-band model», and adjust V;, to produce a binding energy 
E3=60 meV (corresponding to 14.5 THz) for the 1s-exciton state’. We match elec- 
trons and holes to produce the same effective mass me = mp =0.52mp (ref. 39), with 
the free electron mass mo, close to the K point. We solve the closed system of coupled 
differential equations (1) numerically using a Runge-Kutta algorithm. 

We have confirmed that a two-dimensional (2D) computation produces essen- 
tially the same HSG results. Note that bulk WSe; features a layered structure where 
electron-hole motion is free only within each 2D layer. However, only a 1D com- 
putation is numerically feasible to fully include HHG, Bloch oscillations and a 
non-parabolicity in the dispersion ex. The 1D computations also confirm that HHG 
is an order of magnitude weaker than HSG, as in the experiment. 

Coherent electron-hole pair-correlation function. The excitations under- 
lying HSG can be investigated in real space by following the electron-hole pair- 
correlation function 


Sen(t) = (P(r) Pn(0)) 


containing the operators for electron and hole densities f, and /,,, respectively. As 
discussed in ref. 33, the envelope of f,(r) and ,,(0) determines the ways in which 
electrons move with respect to holes on length scales larger than the unit cell. The 
resulting g-n(r) defines the conditional probability of finding an electron at position 
r when a hole is at the origin. 

To identify the coherent electron-hole acceleration, it is convenient to separate 
incoherent and coherent contributions from g.n(r). Following the derivation in 
ref. 33, we find gen(r) =inc(r) + gcon(r), with coherent part g-on(r) = |p(r)|? and 
incoherent part ginc(r) = PePn + Agen(r), which is proportional to the product 
of electron and hole densities (~, and py), to the lowest order. The formation of 
pairwise, incoherent correlations generates an additional contribution Ag.n(r). 
However, gcon(r) is independent of correlation formation and is exclusively defined 
by the coherent polarization p(r), which is given by the Fourier transform of 
pxand describes the relative motion of electron-hole coherences. In other words, 
&con(r) defines the way in which coherently generated electrons are distributed 
with respect to the holes in real space. Because the SBEs fully include the many- 
body aspects of the polarization dynamics px, gcon(r) generalizes the descriptions 
of following a single electron-hole pair during HSG?"”. 

Effect of excitation-induced dephasing on HSG. To quantify collision dynamics 
under different conditions, Extended Data Fig. 6b compares the mean wave vector 
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of the electron-hole wave packet for a 10-fs-long (red curve) and a 100-fs-long 
(black curve) optical excitation pulse centred at t.. = 7 fs, while the dephasing is 
kept constant (T>= 12 fs). The THz waveform (Extended Data Fig. 6a) is identical 
for both optical excitations. The respective envelope of the excitation pulse is shown 
as the grey (yellow) shaded area for the 100-fs (10-fs) pulse. 

We observe that the electron-hole wave packets are accelerated to much higher 
momentum states in the case of the long (100-fs) excitation pulse compared to the 
short (10-fs) excitation pulse. As discussed in the main text, a zero crossing of (k) 
enables collisions and annihilation of optically generated coherent excitons. For 
the short excitation pulse, the magnitude of |p;|” substantially drops after the rec- 
ollision. Therefore, the motion of residual coherent electron-hole pairs is indicated 
by the red dashed curve in Extended Data Fig. 6b, c, whereas the dominant |px|* 
motion is plotted as red solid curve. For the long excitation pulse, the excitonic 
polarization |p,|? is generated continuously, such that it is not diminished sub- 
stantially by the collisions. The corresponding mean wave vector (k) is plotted as 
a black curve in Extended Data Fig. 6b, c. 

Applying a constant dephasing time T> implies that all excitonic components 
of the polarization decay with an identical rate. However, the Coulomb interaction 
scatters polarization with the excited electron-hole distributions, yielding a strong 
exciton-state-dependent EID**. More specifically, the 1s-exciton state (which is 
tightly bound around the K point) features lower scattering rates than other exci- 
ton states that spread out in k, which is included by introducing a k-dependent 
dephasing time T>(k) as shown in the inset of Extended Data Fig. 4a: for low k, 
T2(k) is 12 fs, whereas it decreases to 2 fs at elevated k. Comparably fast dephasing 
times have also been observed in previous studies**4!”, Additionally, the large 
temporal modulations in the HSG intensity as shown in Fig. 2a imply dephasing 
times T, that are substantially shorter than one half-cycle of the THz driving field 
(h< T/2=22fs). 

To determine the effect of EID on electron-hole collisions, Extended Data Fig. 6c 
shows (k) for the momentum-dependent dephasing T>(k). By comparing the 
black curves (100-fs-long excitation) in Extended Data Fig. 6b and c, we conclude 
that EID reduces the maximum excursion by roughly a factor of 3 compared to 
the constant dephasing result. The mean momentum trajectory (k) for the short 
(10-fs) excitation pulse is less affected by EID, owing to the reduced maximum 
displacement. 

Extended Data Fig. 4 compares the measured, time-integrated HSG spectra 
(shaded area) with computations including EID (black curves), a constant dephas- 
ing T> = 3.2 fs (Extended Data Fig. 4a, red curve), and T, = 4 fs (Extended Data 
Fig. 4b, blue curve), respectively. We observe that only calculations taking EID into 
account explain the height of all observed sidebands. More specifically, the simula- 
tions using a constant dephasing achieve good agreement only for either low-order 
sidebands (n < 8, T;=4 fs) or sidebands above 600 THz (T= 3.2 fs). In numbers, 
T,=4 fs (T> =3.2 fs) overestimates (underestimates), for example, the tenth-order 
(fourth-order) sideband by a factor of 1.9 (1.7). This result implies that EID induces 
non-trivial spectral imprints on the HSG spectrum. These modulations result from 
the anharmonic electron-hole acceleration identified in Extended Data Fig. 6c. 

In general, the SBE approach systematically describes the creation of coherent 
excitons, quasiparticle acceleration, collision-related annihilation, the interplay 
between HSG and HHG contributions, and the influence of many-body effects 
such as EID or phonon scattering processes as suggested by ref. 43. Only some 
aspects of these effects can be quantitatively analysed using either a semiclassical 
approach or a quantum description of a single electron-hole pair. Because many- 
body effects can induce drastic changes to HSG and HHG, it is clear that the SBE 
analysis is needed to properly parameterize any simplified model. 

Influence of excitation fluence. With increasing optical fluence ®, the measured 
sideband intensity increases linearly (Extended Data Fig. 2a); however, the mod- 
ulation depth of time-resolved traces of Isc is found to decrease (Extended Data 
Fig. 2b). This feature may be most intuitively explained by considering that a grow- 
ing density of excitons may instantly lead to phase-space filling and broadening 
causing less-well-defined initial conditions for electron-hole recollisions. By con- 
trast, a potential increase in dephasing rates does not seem to be a dominant factor, 
because faster dephasing should result in an increase in the modulation depth—the 
opposite effect of what we observe. Additionally, the subcycle delay 6,., which is 
sensitively dependent on the dephasing time T,, does not change substantially 
with varying excitation fluence, implying that major density-driven effects on T> 
have not yet built up (Extended Data Fig. 2c). However, such excitation-induced 
effects are important for the case of narrowband excitation analysed in Fig. 1 and 
Extended Data Fig. 4. 

Data landscape of quasiparticle colliders. In regular optical spectroscopy, sys- 
tem resonances of interest are often imbedded within the spectral range of the 
applied pump field. Our quasiparticle collider spectrally separates the relevant 
features from the pump as distinct sidebands. Additionally, each sideband order 
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can contain different aspects of the collisions. As in conventional colliders, the 
scattering-generated radiation produces a wealth of information about the (quasi) 
particles. Here, we study the HSG intensity Isc as a function of the HSG frequency 
Vase the field strength E), the duration 7, and the central frequency v/ of the 
optical preparation pulse (\ = opt) and the THz waveform (A= THz), and their 
mutual delay f.,, generating at least eight-dimensional datasets: Iysq(Vusa, Erna 
TTHo VTHe Eop Topp Vopt tex). As is typical for colliders, such a wealth of data reveals 
details of colliding quasiparticles only through a dedicated identification process. 
For example, Fig. 3 demonstrates how changing f., yields a direct detection of the 
electron-hole recollision. We will next illustrate a few additional possibilities to 
extract exciton-related details from the HSG scattering data. 

As shown in Extended Data Fig. 4, broadening of sideband resonances is directly 
related to exciton decay rates, which depend on the excitation parameters. This 
effect is often referred to as EID‘, which can be controlled, for example, by 
changing Epp: (ref. 34). We have also confirmed that the exciton binding energy Eg 
strongly influences the overall strength of the HSG signal. For example, a tenfold 
increase of Eg amplifies Iysq by roughly the same amount as a result of Coulomb 
enhancement of the collisional cross-section. Hence, a variety of material- and 
excitation-dependent exciton features may be deduced by quantitatively analysing 
the strength and width of Iyjsg resonances. 

By changing the optical pump frequency vp; one can deduce differential 
changes in HSG spectra that sensitively monitor material details. We propose 
measuring a 2D Insc(Vtisc Vopt) set to follow spectral changes at the second-order 
sideband and to define a normalized spectrum 


[, VSG — Vopts 
I(v, iS sc ( HSG opt» opt) 


Dss 


in which v= Vvysg — Yop is the frequency scale centred around 1); and 
hsp =Iusc(Vopt + 2VrH» Vopt) is the peak intensity of the second-order sideband. 
This normalization produces spectra that do not depend on the absolute intensity 
scale—a quantity that is difficult to determine experimentally for a wide range of 
frequencies op. From the difference of two spectra measured with pump frequen- 
cies Yopt + Avop: / 2, we deduce a finite differential 


AI(v, Yopt) = I(V, Vopt — Avopt / 2) — I(v, Vopt + Avopt / 2) 


Yopt 
ol 


opt 


which approaches the derivative in the limit Av; 0. 


Extended Data Fig. 8a, b shows computed AJI(v, Vp) for two 1s-exciton 
binding energies, Ez = 60 meV and Ex = 600 meV, respectively, when hop is 
expressed in terms of the detuning Ao, with respect to the 1s-exciton energy hv, 
as hVop, = hv; + Aopt. These contours produce spectral information that differs 
greatly for the two exciton binding energies, demonstrating that AI(v, Vp) exposes 
material-dependent properties. For Ez = 60 meV, AI(v, Vop:) exhibits dominantly 
negative changes for Ao» <0, implying that Iysg decreases for negative detun- 
ing. Because A, > 0 yields mainly positive AI(v, op), a clear dispersive shape is 
observed around the resonant excitation (A,p,=0). For Ez = 600 meV, a positive 
island appears within the Ag» <0 region, which implies that Irjsg can be enhanced 
also for negative App, in contrast to the Ez =60 meV case. Additionally, Aop, > 0 
produces a new feature because the positive islands split into a double-peaked 
structure for Ez = 600 meV. 

To analyse the quantitative dependence of AI(v, Vop:) on Ep, Extended Data 
Fig. 8c, d shows slices of AI(v, Vop:) at fixed values of h(v — v2sp) = £16 meV, 
respectively, for three different binding energies Ez = 60 meV (black curve), 
240 meV (red curve) and 600 meV (blue curve). The slice at —16 meV shows only 
one positive peak for A, > 0 and remains negative at A, <0 for Ez; =60meV 
(black curve). Increasing the exciton binding energy gradually creates a new max- 
imum within the region of negative detuning. This transition from a single- to a 
double-peaked structure can be exploited to assign the exciton binding energy as 
shown below. Monitoring the slices at h(v — 1253) = +16 meV reveals a smooth, 
narrow, single-peaked structure close to Ao,,=0 for Eg =60 meV (black curve). 
By contrast, higher binding energies produce a more structured shape. 

To construct differential spectra AI(v, op) experimentally, we measure HSG 
spectra for narrowband pump pulses at different frequencies 1p; (see Fig. 1b, d) 
while varying t,x. Extended Data Fig. 9c shows the resulting intensity map 
Tusc(s Vopw tex) for resonant excitation hvo,,= hv; the corresponding compu- 
tation is shown in Extended Data Fig. 9e. Both experiment and theory show the 
same spectral structure, including t..-dependent changes in the intensity and 
shape. A similar level of agreement is realized also for non-resonant excitations 
(not shown), paving the way to thorough theory—experiment comparison of the 
differential spectra. 
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On the basis of Extended Data Fig. 8, we expect the strongest dependence of 
AI(v, Vopt) on Eg for a finite detuning A,,; from the 1s-exciton resonance. 
We therefore compare experimental traces AI(v, op) at tex = —20 fs for three 
different detunings A,p,,= —75 meV (shaded area), —29 meV (black curve) and 
27 meV (red curve) in Extended Data Fig. 9a. A negative Ay», produces a disper- 
sive line shape to AI(v, Yop), that is, a dip followed by a zero crossing and a peak, 
which implies a shift of the underlying HSG intensity with a change in Vp. For 
positive A,,,, a double-peaked structure indicates a broadening of the HSG peaks. 
For a binding energy of Ez = 60 meV, these line shapes are well reproduced by the 
theoretical computations using the same detuning A,,; and excitation conditions 
as in the experiment, as shown in Extended Data Fig. 9b. 

Extended Data Fig. 9d, f shows the corresponding calculated differen- 
tials AI(v, vopt) when the exciton binding energy is increased to Eg = 240 meV 
and Eg = 600 meV, respectively. Whereas A,»;= —29 meV (black curve) and 
Aopt=27 meV (red curve) produce a similar shape as for Ez = 60 meV, the peak 
amplitude of the differential curve for Agp,= —75 meV (shaded area) greatly 
decreases for an intermediate binding energy of Ez = 240 meV and turns into a 
negative dip for the largest binding energy. This structural change in AI(v, Vopt) 
can be used to deduce the exciton binding energy. We find that the experimental 
data are consistent with Ez = 60 + 20 meV, which agrees well with values reported 
in literature??*®*”, 
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Extended Data Figure 1 | Polarization of high-order sidebands. 

a, b, False-colour plot of the spectral intensity of high-order sidebands 
(generated under resonant, spectrally narrow, optical excitation) of orders 
two to eight as a function of their polarization for horizontal (a) and 
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vertical (b) polarization of the interband excitation pulse. The polarization 
angle is defined such that 0° corresponds to a horizontally polarized 
excitation. c, Spectrally integrated sideband intensity as a function of the 
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Extended Data Figure 2 | Pump-fluence dependence of time-resolved 
high-order sideband generation. a, The measured high-order sideband 
intensity Iysc (spectrally and temporally integrated, red spheres) scales 
linearly with the pump fluence @, as indicated by a guide to the eye 
(black dashed line). b, The modulation depth of the spectrally integrated 
temporal trace of Ijjsq (red spheres, see Fig. 2b) decreases with increasing 
pump fluence, closely following a fit (black curve) proportional to the 
inverse square-root of the pump fluence. c, By contrast, the relative 
subcycle delay 6,-T”' does not substantially change with increasing 
pump fluence. Red spheres represent the average of 6,-T~! over the nine 
dominant, consecutive, high-order sideband peaks (see Fig. 2d) for 
different pump fluences; error bars indicate their standard deviation. 
The black dashed line marks the mean value of the displayed data points. 
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Extended Data Figure 3 | Coherent excitonic polarization dynamics 

in k space. a, Computed high-order sideband intensity Isc (red shaded 
area) and driving waveform (blue). Vertical black dotted lines highlight 
characteristic delay times t., at extrema of Iysg. b-d, Coherent interband 
polarization |p|? (colour scale) as a function of time t and reciprocal space 
coordinate k (in units of the wave vector kgz limiting the first Brillouin 
zone) for distinct delay times t., according to maximum (D, t.. = 7 fs; 

d, tex = 28 fs) and minimum (c, t., = 18 fs) HSG emission. Horizontal 
white lines mark k= 0. The driving field is depicted as grey curves and 
the recollision times (see Fig. 3d) are highlighted by black arrows. 
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Extended Data Figure 4 | Influence of dephasing on high-order 
sideband generation. a, b, The measured sideband spectrum (shaded 
area) is compared with computations using constant dephasing times 
of T> = 3.2 fs (a, red curve), T,=4 fs (b, blue curve) and a momentum- 
dependent dephasing model as presented in Fig. 1 (black curves; EID, 
excitation-induced dephasing). The sideband orders n are indicated 
above the relevant peaks. All spectra are normalized to the sideband 
peak corresponding to n = 2. The inset in a depicts the corresponding 
dephasing times T>(k) as a function of the wave vector k; the red (blue) 
horizontal line indicates a constant decay level T, = 3.2 fs (T,=4 fs). 
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Extended Data Figure 5 | Field scaling of time-resolved high-order 
sideband generation. a, Scaling of the modulation depth (see discussion 
in Methods section ‘Influence of excitation fluence’) of temporal traces 
of high-order sideband intensity with the driving peak field strength. 

An increase in the modulation depth suggests a faster dephasing T>. 

b, Subcycle delay 6,, in units of the driving period T averaged over the 
eight most-dominant sideband peaks as a function of the peak field. 

The error bars represent the standard deviation of the eight peaks and the 
dashed line depicts a linear fit to the data points. 
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Extended Data Figure 6 | Influence of excitation-induced dephasing. 

a, Terahertz waveform used in the computations. b, c, Mean electron 
excursion (k) calculated with a constant dephasing T= 12 fs (b) and 
excitation-induced dephasing (c) for a delay of t., = 7 fs (see vertical dotted 
lines in a). The shaded areas indicate the intensity envelopes of the 
excitation pulse with a full-width at half-maximum of 10 fs (yellow) and 
100 fs (grey). The corresponding mean momenta (k) are shown as red 

and black curves, respectively. The red dashed curves show (k) after 

the electron-hole collision. The dashed horizontal lines in c mark the 
positions in k space where the scattering time T2(k) switches from slow to 


fast dephasing. 


© 2016 Macmillan Publishers Limited. All rights reserved 


LETTER 


9 
10° 3 —=90° 
—— = 120° 
‘i 17 
=> 10°+ 
& 
= 
2 
O 
=4 
oO 
= 1074 
10° 
300 400 500 600 700 800 900 1000 


Frequency (THz) 


Extended Data Figure 7 | High-order harmonic generation in tungsten frequency ry, = 22.3 THz (peak electric field in air, 21 MV cm7}; 


diselenide. An intensity spectrum (blue shaded area) of THz-driven high- |= y=90°). An intensity spectrum for = 120° is also shown as a black 
order harmonic generation in 60-nm-thick WSe shows distinct peaks at curve. The spectral intensity has been corrected for the grating efficiency 
odd orders n’ = 13 to n! = 47 (indicated by numerals) of the fundamental and for the quantum efficiency of the spectrograph used. 
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Extended Data Figure 8 | Differential sideband spectroscopy. 

a, b, Contours of differential spectra AI(v, opt) (colour scale) as a 
function of h(v — vsg) and detuning A); computed for an excitonic 
binding energy of Ezy =60 meV (a) and Ez = 600 meV (b). The black 
and grey vertical lines mark the positions of the slices shown in c and d, 


respectively. c, d, Snapshots of AI(v, Vopt) at fixed values of |h(v — 125p)| = 
16 meV below (c) and above (d) the second-order sideband peak for three 
different binding energies Ez = 60 meV (black curves), Ez = 240 meV 

(red curves) and Ex = 600 meV (blue curves). 
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a, Measured differential spectra AI(v, Vop:) for three different detunings 
Aopt= —75 meV (shaded area), Aop, = —29 meV (black curve) and 
Appt = 27 meV (red curve) as functions of v, centred at the position of the 
second sideband 1s. b, d, f, Computed differential spectra AI(v, vop:) for 
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Site-selective and stereoselective functionalization 


of unactivated C—H bonds 


Kuangbiao Liao!, Solymar Negretti!, Djamaladdin G. Musaev’, John Bacsa! & Huw M. L. Davies! 


The laboratory synthesis of complex organic molecules relies 
heavily on the introduction and manipulation of functional 
groups, such as carbon-oxygen or carbon-halogen bonds; carbon- 
hydrogen bonds are far less reactive and harder to functionalize 
selectively. The idea of C-H functionalization, in which C-H bonds 
are modified at will instead of the functional groups, represents a 
paradigm shift in the standard logic of organic synthesis'~*. For 
this approach to be generally useful, effective strategies for site- 
selective C-H functionalization need to be developed. The most 
practical solutions to the site-selectivity problem rely on either 
intramolecular reactions‘ or the use of directing groups within the 
substrate**, A challenging, but potentially more flexible approach, 
would be to use catalyst control to determine which site in a 
particular substrate would be functionalized*"!. Here we describe 
the use of dirhodium catalysts to achieve highly site-selective, 


diastereoselective and enantioselective C-H functionalization 
of n-alkanes and terminally substituted n-alkyl compounds. The 
reactions proceed in high yield, and functional groups such as 
halides, silanes and esters are compatible with this chemistry. 
These studies demonstrate that high site selectivity is possible in 
C-H functionalization reactions without the need for a directing 
or anchoring group present in the molecule. 

We have demonstrated that catalyst-controlled site selectivity is 
possible in the dirhodium-catalysed intermolecular reactions of donor/ 
acceptor carbenes with relatively activated C-H bonds, such as those at 
benzylic and allylic positions’? and « to oxygen’*. The carbene-induced 
C-H functionalization is initiated by a hydride transfer event, and 
consequently the reaction is favoured at sites capable of stabilizing 
a build-up of positive charge; thus, tertiary C-H bonds are elec- 
tronically preferred (Fig. 1a)'*. However, the dirhodium-carbene 
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H EDG Ay 2 EDG EWG H H H 
—_——_—_—_—_—_> 
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4 Rh.(S-DOSP) , 1 Rh(R-p-PhTPCP) , 2 
H 


O--Rh 


7 
. 
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O--Rh 
as S0,Ar 74 
H 
H Ar = p-(C44-13H23-27)CeH4 
4 
c 
EWG i 
H EDG~ SN, EDG EWG Bh 
= HJ ‘H 
ROA 15 R 
High yield, site selectivity, 4 
diastereoselectivity and : 
enantioselectivity Ar = p-‘BuCgHy 


Figure 1 | Site-selective C-H functionalization by donor/acceptor 
carbenes. a, Site-selectivity trend in dirhodium-catalysed C-H 
functionalization by donor/acceptor carbenes is controlled by a delicate 
balance between steric and electronic effects. b, Previous studies showed 
that site selectivity at activated C-H bonds could be controlled by 

the nature of the catalyst. Rho(S-DOSP), prefers functionalization at 


Rho[R-3,5-di(o-BuC.H,)TPCP], 15 


secondary and tertiary C-H bonds, whereas Rh»(R-p-PhTPCP), prefers 
functionalization at primary C-H bonds. c, This study demonstrates 
that highly site-selective and stereoselective C-H functionalization 

of unactivated C-H bonds can be achieved with the new catalyst 
Rh)[R-3,5-di(p-'BuC¢H4) TPCP] 4. 
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R group Catalyst name 
p-Ph 2 Rh,(R-p-PhTPCP), 
p-Br 3 h,(R-p-BrTPCP), 
p-'Bu 4 h,(R-p-‘BuTPCP), 
p-NO, 5 h,(R-p-NO,TPCP), 
p-CF, 6 h,(R-p-CF,TPCP), 
3,5-diCF, 7 h,(R-3,5-di-CF,TPCP), 
3,5-diBr 8 h,(R-3,5-di-BrTPCP), 
b OH 
Ho-® ars 
Z 
16 equiv. 
8 —_> 
Pd(dppf)Cl2 
K3PO, 
60-90% 
R group Catalyst name 


H 9 Rh,(R-3,5-di-PhTPCP), 

3,5-diCF, 10 Rh,[R-3,5-di(3,5-diCF,C,H,)TPCPI, 
3,5-diCH, 11 Rh,[R-3,5-di(3,5-diCH,C,H,)TPCP], 
3,5-diPh 12 Rh,[R-3,5-di-(3,5-diPhC,H,)TPCPI, 


p-Ph 13. Rh, [R-3,5-di(o-PhC,H,) TPCP], 
p-CF, 14 Rh,{R-3,5-di(o-CF,C,H,)TPCP], 
p-“Bu 15 Rh, [R-3,5-di(o-tBuC,H,) TPCP], 


Figure 2 | Synthesis of TPCP carboxylate dirhodium catalysts. 

a, Original synthesis of TPCP carboxylate dirhodium catalysts. 

b, New synthesis of TPCP carboxylate dirhodium catalysts using 
palladium-catalysed cross-coupling on a preformed dirhodium complex. 
See Supplementary Information for the synthetic details. 


complex is sterically demanding, so on steric grounds the primary 
C-H bond would be preferred. We have shown that by altering the 
steric nature of the catalyst, the site selectivity of C-H functional- 
ization can be dictated in these activated substrates from a prefer- 
ence for the secondary or tertiary C-H bonds using the established 
dirhodium catalyst Rh2(S-DOSP), (1), to the primary C-H bond 
with the more sterically encumbered triarylcyclopropanecarbox- 
ylate catalyst Rh2(R-p-PhTPCP), (2) (Fig. 1b) 12.13 Even though 
intermolecular C-H functionalization with metal carbene inter- 
mediates has been a longstanding challenge*”'*"'%, the metal- 
catalysed reactions of n-alkanes with the more traditional acceptor carbene 
derived from ethyl diazoacetate has been explored, furnishing mixtures 
of products, controlled to some extent by the nature of the catalyst, with 
primary C-H insertion preferred when employing larger and more 
electrophilic catalysts®'*. The reaction of n-alkanes with the donor/ 
acceptor carbene derived from methyl phenyldiazoacetate showed even 
greater preference for primary C-H insertion when a bulky rhodium 
porphyrin catalyst was used'’. We describe a new triarylcyclopropan- 
ecarboxylate catalyst, Rh2[R-3,5-di(p-'BuCsH4)TPCP], (15), that can 
achieve highly regio-, diastereo- and enantioselective C-H function- 
alization at the unactivated C2 position of n-alkanes or terminally 
substituted n-alkyl compounds (Fig. 1c). 
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CO.R’ 
Ar No 
H 16: Ar = p-BrCgHy, 
R’=CH,CCl, aes 4 i. 
oe Rh, catalyst (1 mol%) R’02C ‘Ar : we COR’ 
17 19 

Rh, Site selectivity Diastereoselectivity Enantioselectivity Combined 
catalyst (17:18:19 ratio) (d.r. for 18) (e.e.% for 18) yield (%) 
1 n.d.:29:1 3:1 82 98 

2 1:2:n.d. 14:1 92 97 

3 1:4:n.d. 6:1 91 98 

4 1:5:n.d. 15:1 92 95 

5 13:36:1 3:1 79 82 

6 1:7:n.d. $:1 94 87 

7 1:11:n.d. 8:1 81 92 

8 1:26:n.d. 5:1 78 95 

9 1:22:n.d. 6:1 96 95 

10 1:26:n.d. 26:1 92 96 

11 1:16:n.d. 29:1 99 95 

12 1:5:n.d. 16:1 97 98 

13 1:9:n.d. 10:1 89 91 

14 1:30:n.d. 8:1 91 95 

15 1:25:n.d. 20:1 99 99 


Figure 3 | Catalyst optimization studies. Evaluation of the dirhodium 
TPCP catalysts for the C-H functionalization of pentane. The 

optimum catalyst for high site selectivity at C2, diastereoselectivity, 
enantioselectivity and yield is catalyst 15, Rhj[R-3,5-di(p-'BuCgH,) 
TPCP],. The standard reaction was carried out in refluxing pentane with 
1 mol% catalyst loading. See Supplementary Information for experimental 
details. n.d., not determined. 


Having discovered that the triphenylcyclopropane carboxylate 
(TPCP) catalysts can exert considerable influence on the site selectivity 
of donor/acceptor carbenes with activated substrates for C-H func- 
tionalization!*!>, we decided to explore whether these catalysts could 
achieve selective intermolecular C-H functionalization of unactivated 
C-H bonds. To initiate this project, we prepared a series of chiral 
dirhodium triarylcyclopropane carboxylate catalysts. The first series 
of dirhodium catalyst derivatives, 2-8, was prepared by our standard 
method, which involves asymmetric cyclopropanation, hydrolysis of 
the resulting ester to the carboxylic acid, enantiomer enrichment, and 
ligand exchange (Fig. 2a)”. When it became clear that catalysts 7 and 
8 with a 3,5-disubstituted phenyl group had promising properties (see 
later), we developed a new approach for catalyst diversification by con- 
ducting an eightfold palladium-catalysed cross-coupling reaction on 
the fully formed 3,5-dibromo complex 8 to generate a second series of 
catalysts (9-15) (Fig. 2b). The new approach has broad utility because 
the catalysts are made from a single enantiomerically pure complex, 
avoiding the variable yields observed in ligand exchange reactions 
when the ligands become sterically congested and streamlining the 
whole process. 

The initial evaluation of the different catalysts was conducted using 
n-pentane as the test substrate. Even though n-pentane is a simple 
substrate, the challenge of C-H functionalization of this system is 
apparent when one considers the subtlety required in achieving a 
site-selective reaction. Recent studies have shown that donor/accep- 
tor carbene reactions tend to proceed better when the trichloroethyl 
ester is used instead of the methyl ester’. Hence, 2,2,2-trichloroethyl 
2-(4-bromophenyl)-2-diazoacetate (16) was used as the carbene 
precursor for the initial evaluation (Fig. 3). The N-sulfonylprolinate 
Rh2(R-DOSP), (1) is the standard chiral catalyst that has been used 
for most of the C-H functionalization chemistry of donor/acceptor 
carbenes, but its reaction with n-alkanes has not been reported!*1¢, 
Therefore, it was promising to observe that when Rh2(R-DOSP),4 
was used as the catalyst, 16 reacted with pentane to give only 18 and 
19. The products derived from insertion into methylene C-H bonds 
were observed in a high overall yield and good site selectivity (29:1 
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a Modification of ester R’ 


Br 20 
fe) 18:1 rr. 
H3C. 8:1 dr. L 
ie) H H 92% e.e. fe) H H 
89% yield 


b Modification of aryl group R 


CF 23 
15:1 rr. CClz O 
14:1 dor. 
97% e.e. (e) 
91% yield H™)H 
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Figure 4 | C-H functionalization of alkanes and substituted 
n-alkanes. a-c, Evaluation of the scope of Rho[R-3,5-di(p-'BuC¢H4) 
TPCP]4-catalysed C-H functionalization of pentane and substituted 
n-alkanes. a, Compounds 20-22 illustrate the effect of the ester group. 
b, Compounds 23-25 illustrate the effect of modifications to the donor 


regioisomeric ratio (r.r.) of C2 to C3). The enantiocontrol was also rea- 
sonable (82% enantiomeric excess (e.e.)), but the diastereocontrol was 
moderate (3:1 diastereomeric ratio (d.r.)). Even though the reaction 
displayed exceptional site selectivity considering that both reacting 
sites are methylene sites with only a slight difference in steric environ- 
ment, we recognized that the competition with the internal methylene 
sites would probably lead to complex mixtures when longer n-alkanes 
were used as substrates. Therefore, we examined the reaction with 
the triarylcyclopropanecarboxylate catalysts, as previous studies had 
shown that these catalysts behave as more sterically encumbered cat- 
alysts and drive the C-H functionalization site selectivity towards the 
less sterically hindered positions!*'’. This expected trend was followed 
by the p-substituted catalysts 2-6, which generated a considerable 
amount of the Cl insertion product 17 in addition to C2 18 (1:3 to 
1:7 ratio), without any C3 insertion product 19 being formed. The 
diastereo- and/or enantioselectivity of the C-H functionalization with 
catalysts 2-6 is influenced by the nature of the aryl substituent on the 
ligand, with the p-Ph derivative 2 giving the highest level of enantiose- 
lectivity (92% e.e.) and the p-tert-butyl derivative 4 giving the highest 
diastereoselectivity (15:1 d.r.). When the reaction was extended to the 
3,5-di-CF3-substituted catalyst 7, we expected increased formation 
of the Cl insertion product. However, this was not the case, and a 
strong preference for C2 over C1 functionalization (11:1 ratio) was 
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Br 21 Br 22 
30:1 rr. CBr30 19:1 rer. 
15:1 dor. 21:1 dor. 
99% e.e. fe) H H 98% e.e. 
91% yield 90% yield 
tBy 24 Cl 25 
22:1 rt. CClz O 28:1 rr. 
24:1 dor. 55:1 dor. 
>99% e.e. 91% e.e. 
91% yield 87% yield 
Br 27 Br 28 
18:1 rr. ris ie) 18:1 rer. 
9:1 dor. 9:1 dor. 
97% e.e. Oo H H 93% e.e. 
85% yield 84% yield 
Cl 
Br 30 Br 31 
9:1 rr. i fe) 34:1 rr. 
9:1 dor. 4:1 dor. 
95% e.e. Oo H H >99% e.e. 
65% yield This 40% yield 
Br 33 Br 34 
2dr 16:1 rr. 
7:1 dr. 9:1 dor. 
92% e.e. 91% e.e. 
83% yield 84% yield 


0° Bu 


group. c, Compounds 26-34 illustrate the effect of the alkyl substituent. 
Reactions with pentane were conducted in pentane as solvent. Reactions 
with substituted alkanes were conducted with 3 equiv. of the substituted 
alkane with dichloromethane as solvent. See Supplementary Information 
for complete experimental details. 


observed without any formation of the C3 functionalized product 19. 
Furthermore, the diastereoselectivity and enantioselectivity for the 
formation of 18 was quite high (8:1 d.r., 81% e.e.). A similar effect was 
also seen with the 3,5-di-Br-substituted catalyst 8, which gave an even 
higher C2 to Cl ratio (26:1), but slightly lower levels of diastereoselec- 
tivity and enantioselectivity (5:1 d.r., 78% e.e.). On the basis of these 
results, the study was extended to the 3,5-diaryl-substituted catalysts 
9-15, prepared from the cross-coupling strategy on the dirhodium 
complex 8. Many of these catalysts gave a strong preference for C-H 
functionalization at C2 over C1 (up to 30:1), some generated the C2 
product 18 with exceptional diastereoselectivity (up to 29:1 d.x.), and 
all gave high levels of enantioselectivity (89-99% e.e.). The optimum 
catalyst in terms of overall performance was the 3,5-di(p-tert-butyl- 
phenyl)phenyl-derived catalyst 15, Rh[R-3,5-di(p-'BuCgHy)TPCP] 4, 
which gave the C2 product 18 in 99% overall yield with high site selec- 
tivity favouring C2 over C1 (25:1 ratio), diastereoselectivity (20:1 d.r.) 
and enantioselectivity (99% e.e.). 

The Rh,[R-3,5-di(p-'BuCgH4)TPCP],-catalysed reactions with 
pentane were then examined with a variety of donor/acceptor 
carbene precursors (Fig. 4a, b). All of the reactions proceeded in high 
yield (87-91%) and with high levels of enantioselectivity (91% to 
>99% e.e.). The first three substrates were chosen to examine the 
nature of the ester substituent (Fig. 4a). The methyl ester derivative 
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was quite effective in the C-H functionalization to form 20, but over- 
all its performance in terms of yield and selectivity was lower than 
the trichloroethyl ester 16. Larger alkyl esters are best avoided when 
attempting intermolecular C-H functionalization reactions, as intra- 
molecular C-H functionalization will tend to compete. However, the 
trifluoroethyl and tribromoethy] esters are effective ester groups and 
perform comparatively to the trichloroethyl derivatives to form 21 and 
22. In specific cases, the trifluoroethyl group may be worth considering 
further because the site selectivity for C2 for 21 is better than with the 
trichloroethy] derivative (30:1 versus 25:1 r.r.), although the diastere- 
oselectivity is lower (15:1 versus 20:1 d.r.). The reaction was extended 
to a series of phenyl-substituted carbenes (Fig. 4b). They all performed 
well in these reactions, but the site selectivity and diastereoselectivity 
were somewhat reduced for the p-trifluoromethyl derivative 23 
(14:1 C2 versus C1, 15:1 d.r.). The potential breadth of utility of the 
C-H functionalization is illustrated with a pyridine system, which is 
an effective substrate for this chemistry. Indeed, the C-H functional- 
ization product 25 was produced with the highest diastereoselectivity 
to date (55:1 d.r.). 

As the selectivity competition is between the C2 and C1 positions, 
the ratio would be expected to remain about the same for longer- 
chain alkanes. This was indeed the case for the reaction with n-octane, 
which generated 26 with a site selectivity of 27:1 rr., slightly enhanced 
compared with pentane (Fig. 4c). One of the challenges of C-H func- 
tionalization is to conduct the reactions in the presence of functional 
groups. Having established that Rh2[R-3,5-di(p-'BuCsH4)TPCP], is 
an exceptional catalyst for C-H functionalization of n-pentane, we 
performed an initial evaluation to determine if the reaction can be 
conducted in the presence of terminally substituted n-alkyl compounds, 
such as alkyl halides, silanes and esters (Fig. 4c). These reactions were 
conducted with the bromophenyl derivative 16 and 3 equiv. of the 
substrate in dichloromethane as solvent under reflux conditions. 
Alky1 halides are versatile functionality in organic synthesis but as 
the dirhodium carbene is highly electrophilic we considered whether 
they would be compatible with the C-H functionalization chemis- 
try. Indeed, 1-bromo-, 1-chloro- and 1-fluorohexane were found to 
be good substrates for the C-H functionalization. In all three cases, 
the levels of enantioselectivity for the formation of 27-29 were high 
(92-97% e.e.), the site selectivity for the methylene C-H bond was 18:1 
but the diastereoselectivity was somewhat diminished (9:1 d.r.). The 
reaction with 1-bromopentane to form 30 also proceeded with high 
enantioselectivity (95% e.e.) but the yield was only 65% and the site 
selectivity was only 9:1 r.r. This result may indicate that the bromine is 
displaying a long-range inductive effect, which slightly deactivates the 
C2 position. Such a characteristic could become a useful controlling 
element in more complex systems. n- Alkyl silanes were considered 
to be interesting substrates because electronically the C-H bonds B 
to silicon should be electronically activated towards C-H function- 
alization. The steric influence dominates in these reactions, and pre- 
ferred formation of the regioisomers 31 and 32 was observed once 
again. Another functional group that is compatible with this chemistry 
is an ester, as illustrated in the formation of 33 and 34, again with 
strong preference for functionalization of the methylene C-H bond. 
These studies demonstrate that the Rh2[R-3,5-di(p-'BuCsH4)TPCP]4- 
catalysed reactions of donor/acceptor carbenes have a strong prefer- 
ence for functionalization of the methylene site at the C2 position 
of alkanes and terminally substituted alkanes. Presumably, the C-H 
functionalization at the methyl group is less favoured on electronic 
grounds, whereas the steric environment around the catalyst is suffi- 
cient to distinguish between the methylene sites. All of the reactions 
proceed with high enantioselectivity (90% to >99% e.e.), but the dias- 
tereoselectivity is variable (4:1-9:1 d.r.). 

Further evidence to help understand why Rh2[R-3,5-di(p-'BuC¢H,) 
TPCP]4 is such an effective catalyst was obtained from computational 
and X-ray crystallographic studies. We and others have previously 
shown that the presence of four identical chiral ligands around the 
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Figure 5 | Structural information about the dirhodium catalysts. 

a, Computational structure of the a, 8, a, 8 form of Rh2[S-3,5- 
diPhTPCP],. b, Computational structure of the a, a, a, a form of 
Rh,(S-3,5-diPhTPCP), (5.0 kcal mol™! less stable than the a, 8, a, 3 form). 
c, X-ray crystal structure of Rhy[S-3,5-di(p-'BuCsH4)TPCP]4, in which 
the two 3,5-di(p-'BuC¢H4)CsH3- groups on the top face are highlighted 

in yellow. For clarity, the diethyl ether molecules that were coordinated 

to the rhodium in the crystal structure have been removed. See the 
Supplementary Information for complete experimental details. 


dirhodium core can result in a catalyst with higher symmetry than 
the ligands themselves'*-?*. Computational studies on Rh,[S-3,5- 
diPhTPCP], revealed that the 3,5-disubstituted pattern disfavours 
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the existence of two adjacent 3,5-diphenylphenyl groups on the 
same face of the dirhodium catalyst. Hence the preferred orienta- 
tion of the catalyst is the a, 8, a, B-orientation (Fig. 5a), which is 
favoured over the a, a, a, a-orientation (Fig. 5b) by 5.0 kcal mol!, 
The results from our computational studies were consistent with data 
from X-ray crystallographic analysis. An X-ray crystal structure of 
15 as the corresponding bis(diethyl ether) complex was solved, and 
the complex was found to adopt the a, 8, a, B-orientation (Fig. 5c). 
This arrangement is D2 symmetric, which causes both faces of the 
dirhodium catalyst to be the same, and limits the number of possible 
orientations of the carbene when it binds to the dirhodium core. 
The C-H functionalization has been proposed to proceed via a 
concerted asynchronous mechanism in which the hydrogen of the 
C-H bond first approaches the carbene site”’. It has been previously 
demonstrated that when C-H insertion into methylene C-H bonds 
occurs, the reactions can be highly diastereoselective as long as there 
is considerable size differentiation between the two other groups at the 
methylene site'*. The size difference between a methyl and an n-propyl 
group, as in the case of pentane, would not be expected to be sufficient 
to cause high levels of diastereoselectivity using the more traditional 
chiral dirhodium tetracarboxylate catalysts, but when the bulky TPCP 
catalysts are used, high diastereoselectivity is also possible. 

These studies demonstrate that highly site-selective C-H function- 
alization of unactivated C-H bonds at C2 of alkanes and terminally 
substituted alkanes is a viable process. The reactions proceed in high 
yield and functional groups such as halides, silanes and esters are com- 
patible with this chemistry. A new class of D2-symmetric dirhodium 
catalysts were developed that are capable of achieving not only high 
site selectivity in these C-H functionalization reactions, but also high 
levels of diastereocontrol and enantiocontrol. These results show that 
high site selectivity is possible in C-H functionalization reactions 
without the need for a directing or anchoring group present in the 
molecule. The demonstration that it is possible to design catalysts 
with defined chiral pockets to control not only the enantioselectivity, 
but also the diastereo- and site selectivity of carbene-induced C-H 
functionalization, could have broad implications on future research 
directions on selective C-H functionalization. 
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Ancient micrometeorites suggestive of an 
oxygen-rich Archaean upper atmosphere 


Andrew G. Tomkins!, Lara Bowlt!, Matthew Genge?’, Siobhan A. Wilson!, Helen E. A. Brand* & Jeremy L. Wykes!*° 


It is widely accepted that Earth’s early atmosphere contained less 
than 0.001 per cent of the present-day atmospheric oxygen (O2) 
level, until the Great Oxidation Event resulted in a major rise in 
O, concentration about 2.4 billion years ago!. There are multiple 
lines of evidence for low O2 concentrations on early Earth, but 
all previous observations relate to the composition of the lower 
atmosphere? in the Archaean era; to date no method has been 
developed to sample the Archaean upper atmosphere. We have 
extracted fossil micrometeorites from limestone sedimentary rock 
that had accumulated slowly 2.7 billion years ago before being 
preserved in Australia’s Pilbara region. We propose that these 
micrometeorites formed when sand-sized particles entered Earth’s 
atmosphere and melted at altitudes of about 75 to 90 kilometres 
(given an atmospheric density similar to that of today*). Here we 
show that the FeNi metal in the resulting cosmic spherules was 
oxidized while molten, and quench-crystallized to form spheres 
of interlocking dendritic crystals primarily of magnetite (Fe3;04), 
with wiistite (FeO)+metal preserved in a few particles. Our model 
of atmospheric micrometeorite oxidation suggests that Archaean 
upper-atmosphere oxygen concentrations may have been close to 
those of the present-day Earth, and that the ratio of oxygen to carbon 
monoxide was sufficiently high to prevent noticeable inhibition 
of oxidation by carbon monoxide. The anomalous sulfur isotope 
(A338) signature of pyrite (FeS2) in seafloor sediments from this 
period, which requires an anoxic surface environment’, implies that 
there may have been minimal mixing between the upper and lower 
atmosphere during the Archaean. 

The low concentration of O in Earth’s lower atmosphere during 
the Archaean period (3.9-2.5 Ga) has been demonstrated through 
(1) preservation of detrital pyrite and uraninite in ancient sediments 
derived from weathering of rocks on land (these weather rapidly in 
the presence of oxygen’), (2) a lack of oxidized iron in palaeosols®, 
and most definitively, (3) the strong mass-independent fractionation of 
sulfur isotopes recorded in seafloor pyrite’. In addition, large volumes 
of banded iron formations in late Archaean sedimentary rock sequences 
require high concentrations of dissolved Fe?* to have existed in the 
Archaean oceans, which buffered atmospheric and oceanic O, to very 
low concentrations’. After the evolution of photosynthesizing bacteria 
(possibly at ~2.7 Ga; refs 8, 9), this balance was maintained while there 
was sufficient dissolved Fe*+ in seawater, which was probably facili- 
tated by particularly active volcanism in the period 2.7-2.4 Ga (ref. 7). 
The highly variable A**S signature of pyrite (FeS2) in carbonaceous 
shales from this period is evidence that volcanically released SO2 was 
dissociated by ultraviolet radiation in the atmosphere and then never 
re-oxidized; this requires the O2 concentration to have been <0.001% 
of the present atmospheric level*. The Great Oxidation Event is thought 
to have occurred when the rate of O2 production outstripped the rate of 
its removal via iron oxide sedimentation!, as a consequence of increased 
bacterial colonization and decreased volcanic activity”. After 2.4Ga 


the A°*’S of pyrite is invariant, indicating that atmospheric O, had 
significantly increased’. 

Sulfur has been injected continuously into Earth’s atmosphere 
by volcanic eruptions over geological time. In the case of the largest 
ultraplinian eruptions, ejection columns can reach the upper strato- 
sphere, rising to heights of 50 km (ref. 12). Today, volcanic material 
from a single eruption may remain in the stratosphere for years because 
there is little mixing within this layer'’. On the other hand, micro- 
meteorites entering the modern atmosphere experience their peak tem- 
perature at 75-90 km altitude’ as they are decelerated from velocities 
exceeding 12kms~' between 80km and 150km altitude’>. For those 
micrometeorites that were sufficiently heated to melt completely, their 
high surface area results in quench crystallization as they cool at the end 
of deceleration. (We note that 70%-90% of modern micrometeorites 
with diameters >100|1m are completely melted, whereas ~22% of those 
25-50 um in size are fully or partially melted'®.) Because small micro- 
meteorites are melted and quench-crystallized within two seconds, 
modern micrometeorites only chemically interact with higher levels 
of the atmosphere. Therefore, we suggest that fossil micrometeorites 
trapped in sedimentary rocks represent a previously untapped, long- 
term record of the chemical composition of Earth’s upper atmosphere. 

To obtain fossil micrometeorites that are older than the oldest so far 
found (which fell 1.8 Gyr ago’), we sampled limestone layers from the 
Meentheena Member from the Pilbara region of northwest Australia. 
The 30-50-m thick Meentheena Member is part of the Tumbiana 
Formation in the Mount Bruce Supergroup of the Hamersley Basin’®; 
its age is!? 2,721 +4 Ma. We chose this unit because (1) it is barely 
affected by deformation and metamorphism, (2) its sediments were 
deposited slowly (a deposition rate of ~45 m Myr™!; ref. 20), thereby 
allowing greater accumulation of micrometeorites, and (3) limestone 
is easily dissolved with acid, allowing rapid extraction of microme- 
teorites from large samples. We preferentially selected samples from 
surface outcrops with fine horizontal laminations (from 21° 17’ 43.8” 
S, 120° 27! 28.7” E; Extended Data Fig. 1) because these imply a slow 
deposition rate in a calm, deeper-water setting. 

Sixty micrometeorites, 8.6-50 1m in diameter, were separated from 
three limestone samples and examined by microanalytical techniques 
(Methods). All extracted micrometeorites are cosmic spherules — 
meteoritic material that was fully melted during atmospheric entry. 
Fifty-nine of these are I-type (iron) cosmic spherules (that is, they con- 
tain no silicate material, only FeNi metal and/or Fe oxide phases) and 
one is a glassy type (that is, devitrified silicate glass). I-type cosmic 
spherules represent meteoritic FeNi metal that melted and reacted with 
the atmosphere during deceleration, via oxidation of Fe to form wiistite 
and magnetite’®. 

Figure 1 shows a representative selection of the recovered spherules; 
for comparison, examples of modern I-type spherules and weathered 
micrometeorites are shown in Extended Data Fig. 2. The exterior of all 
I-type spherules consists of interlocking dendritic crystals of magnetite 
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Figure 1 | Examples of fossil micrometeorites recovered in this study. 
a-d, Representative examples of surface features of the micrometeorites, 
characterized by interlocking dendritic iron oxide crystals. e—h, 
Representative examples of the internal features of the micrometeorites. 
The majority of micrometeorites that we sectioned were identical to the 


(Fe*+Fe?*,O,) and occasionally, the less oxidized wiistite (FeO). Some 
spherules have limestone still attached, indicating that they are unlikely 
to represent modern contamination. The interiors of sectioned I-type 
spherules vary from being magnetite-only (9 of 11 micrometeorites) to 
wiistite with FeNi metal (2 of 11 micrometeorites; Fig. 1, Extended Data 
Table 1, Extended Data Fig. 3), identified via wavelength dispersive 
X-ray spectroscopy (WDS) and Raman spectroscopy. Synchrotron 
powder X-ray diffraction of two additional micrometeorites found 
only magnetite throughout the entire volume of both micrometeorites 
(Extended Data Fig. 4). 

Evidence that the recovered specimens are micrometeorites that 
were oxidized in the atmosphere, rather than products of diagenesis, 
metamorphism or later weathering includes: (1) the spheres of FeNi 
metal encased in wiistite (Fig. 1) confirm an extraterrestrial origin 
(FeNi metal does not form, and wiistite is rare, in sedimentary environ- 
ments on Earth) and indicate quenching of a partially oxidized metallic 
liquid; (2) the spherical morphology and high surface area of the inter- 
locking magnetite crystals indicate rapid crystallization from a liquid 
with high surface tension, these are typical of modern micrometeorites 
recovered from Antarctic ice!® and the deep ocean”|, and inconsistent 
with oxidation of iron metal encased in sediment”, where there is no 
mechanism to promote retention of the spherical shape (for example, 
Extended Data Fig. 2e); and (3) oxidation of metal during modern 
weathering produces haematite, ferrihydrite and goethite, and results 
in volume expansion sufficient to destroy the delicate submicrometre 
surface textures shown in Fig. 1. 

Given their oxidized mineralogy, we designed a mathematical 
model to examine micrometeorite oxidation during atmospheric entry 
(see Methods). Molecules in the Archaean upper atmosphere capable 
of causing oxidation of Fe metal include O2 and CO; (Extended Data 
Fig. 5), whereas CO is the only moderately abundant species capable of 
causing reduction; other species were likely to have been present at con- 
centrations too low to be relevant”’. Given that nearly all of the I-type 
cosmic spherules from the Meentheena Limestone contain dominant 
proportions of magnetite and/or wiistite relative to metal, the oxidizing 
molecules must have dominated Earth's upper atmosphere at 2.72 Ga. 
Equilibrium-based calculations imply that small micrometeorites 
cannot be oxidized to liquid magnetite by a CO2-dominated atmos- 
phere, and that some O; is required (Extended Data Fig. 5). But the 
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magnetite-only examples shown in g and h. The metal-dominated sample 
in e is the only one we observed; the spheres of wiistite in metal indicate 
mingling of immiscible melts. The small size of this example is consistent 
with the enhanced preservation of metal in smaller micrometeorites 
predicted by the oxidation modelling (Fig. 2). 


relative concentrations of O2 and CO, are challenging to model, 
because the short duration of atmospheric heating means that equilib- 
rium is never reached. Nonetheless, experiments on high temperature 
iron oxidation in mixed gases show that oxidation in the presence of 
O, is rapid and readily forms magnetite-+wiistite, whereas oxidation 
driven by CO; alone is slow and produces only wiistite (Extended Data 
Fig. 6)**?°. The predominance of magnetite in the Archaean micro- 
meteorites thus implies that O2 was an important oxidizing species. 
Furthermore, experiments indicate that when oxygen is present, addi- 
tion of CO; has little effect on brief oxidation events”*. Our modelling 
therefore investigates how much oxygen would need to be present to 
generate the observed micrometeorite mineralogy. 

Modelling of entry heating and oxidation of iron micrometeorites 
is complicated by competition between Fe oxidation and evaporative 
removal of the iron-oxide exterior; evaporation is greater for larger 
particles, faster entry velocities and steeper entry angles (Extended Data 
Fig. 7). Nonetheless, owing to the high temperatures, molten state and 
high surface area, reaction between metal and the atmosphere is rapid. 
Consequently, iron-oxide-free metal spherules have not been found 
amongst thousands of modern micrometeorites”°. There are, however, 
modern examples of incomplete equilibration where a thick magnetite- 
wiistite shell surrounds a metal bead!®. 

Figure 2 shows the results of atmospheric oxidation modelling for 
I-type micrometeorites, and demonstrates that significant survival 
of metal occurs when atmospheric oxygen abundance is lower than 
0.2 times the present atmospheric level (PAL), particularly in the case of 
smaller micrometeorites. In contrast, oxide- and particularly magnetite- 
dominated spherules are most abundant at atmospheric O2 concentra- 
tions similar to the Earth’s current atmosphere. The absence of metal 
in the majority of the sectioned Archaean spherules, coupled with 
their small size, therefore indicates that they formed by heating in an 
atmosphere with a dramatically higher proportion of oxygen than the 
estimated surface concentration of oxygen at this time (<1 x 107° 
PAL’). Assuming that current models of the Archaean lower atmos- 
phere are accurate, our results imply strong decoupling of the lower and 
upper atmosphere at that time. 

In today’s atmosphere there is rapid, vigorous mixing below the 
tropopause (altitude 8-18 km), little vertical mixing within the strato- 
sphere (vertical diffusivity is ~0.1 ms~'; ref. 27) below the stratopause 
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Figure 2 | Results of the atmospheric oxidation model. Shown in colour 
are the stability fields for the dominant iron-bearing phases after heating 

relative to the size and entry angle of particles. Abbreviations: met, 

metal; w, wiistite; mag, magnetite. a, b, Results are given for two different 
entry velocities; 12 km s-! (a) is the minimum possible entry velocity, 


(50-55 km), and moderate vertical mixing within the mesosphere up to 
the mesopause (85 km), above which there is minimal mixing again’. 
The extent of vertical atmospheric mixing is controlled by the thermal 
properties of the gases; because hotter air is less dense than colder air, 
mixing is inhibited where air temperature increases with altitude, as 
in the stratosphere where more ultraviolet radiation is absorbed and 
converted to heat by ozone at higher altitudes (Fig. 3). Importantly, 
although mixing occurs within the mesosphere, there is minimal 
chemical communication between the altitude of maximum I-type 
micrometeorite heating (75-90 km) and that of volcanic flux (in the 
troposphere and stratosphere, 0-50km)"’, largely due to the inverted 
temperature profile of the stratosphere. Our observations suggest that 
a similar atmospheric structure with inhibited vertical mixing existed 
at 2.72 Ga. Kasting”* suggested that biosphere-atmosphere interac- 
tions may have moderated the CH4:CO) ratio to stabilize an Archaean 
atmosphere with an optically thin organic haze; we suggest that if such 
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Figure 3 | Profile of the modern atmosphere. Shown are the altitudes 
reached by volcanic eruptions (lower grey shaded area) compared with the 
altitudes of maximum micrometeorite heating (upper grey shaded area), 
as well as the temperature profile (dashed curve) and how this influences 
vertical mixing. 
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and 18kms_! (b) isa common velocity for meteoroids that survive 
atmospheric entry. The coloured stability fields (olive, pink, red) are for 
the current atmospheric oxygen abundance; dashed lines indicate the shift 
in these fields at different oxygen abundances (0.2, 0.5, 0.8, and 2.0 times 
PAL, numbers on curves). 


a haze existed it would absorb heat and create an inverted temperature 
profile (see, for example, ref. 28) thereby suppressing mixing between 
the oxygen-poor lower and oxygen-rich upper atmosphere. 

Some chemical models of the Archaean atmosphere suggest that a 
methane-bearing atmosphere could have existed up to ~50 km altitude 
(Fig. 4)”°, and this could represent the hypothesized transition between 
poorly mixing upper and lower atmospheric domains. However, micro- 
meteorites cannot test the altitude of this transition since they are 
quench-crystallized by the time they reach ~75 km. The only work 
similar to that presented here is an analysis of the oxidation state of 
impact ejecta preserved in 3.24-Gyr-old spherule beds, which found 
considerable heterogeneity in oxidation state and that atmospheric oxy- 
gen fugacity was <10~* bar at that time*®. Although Krull-Davatzes 
et al. suggested that the oxidation state of impact ejecta is set within 
the ejection plume (see ref. 30 and references therein), and thus impact 
spherules represent samples of the lower atmosphere, it is plausible 
that some are remelted during re-entry*!. Because impact spherules 
are up to two orders of magnitude larger than micrometeorites and 
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Figure 4 | A recent model of the Archaean atmosphere that includes 
methane, showing the effects of CO, photolysis on O2 and CO 
concentration. The solid blue and purple lines indicate the estimated O2 
and CO concentrations of the existing model’, whereas the dashed blue 
and purple lines (arrowed) approximate the relatively small shifts in the 
positions of these lines needed to satisfy the observed micrometeorite 
oxidation. Modern O) (green line) is shown for reference; H2 and O3 
concentrations are too low to affect micrometeorite chemistry. This figure 
is modified from ref. 29 with permission from John Wiley and Sons, 
license number 3816161033966. 
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re-enter the atmosphere at lower velocity, those that remelted would 
sample a thicker belt and lower level of the atmosphere, possibly close to 
the hypothesized oxygen transition. Detailed modelling of impact 
spherule re-entry thus represents a possible avenue for sampling the 
chemistry of middle levels of the atmosphere; in particular, larger par- 
ticles are more likely to have been remelted, so certain size fractions 
may have sampled appropriate atmospheric levels. 

Chemical models of the Archaean atmosphere are based on gas fluxes 
from modern volcanoes and the effects of ultraviolet photolysis on 
those gases, and constrained by the above observations from ancient 
rocks”. These models imply that at lower atmospheric levels, reaction 
with abundant volcano-derived H) gas eliminated free O2, consistent 
with the A**s data that require an oxygen-poor lower atmosphere. 
Above an altitude of ~50km, photolysis of CO2 produced O2 and CO 
(Fig. 4; ref. 29), but there is extensive debate about how much CO) was 
present in the Archaean atmosphere’, with estimates ranging from 10 
to 1,000 PAL. The more CO; present in these models, the more O and 
CO is produced by photolysis in the upper atmosphere. Furthermore, 
these models suggest that there may have been more CO produced than 
O,; for example, the model in Fig. 4 suggests an upper atmosphere with 
an O2/CO ratio of ~0.2. However, this abundant CO is inconsistent 
with the observed magnetite-rich micrometeorites because it would 
act as a reductant (for example, CO + FeO =CO, + Fe; Extended Data 
Fig. 5). At equilibrium, the point where oxidation balances reduction 
comes when there is half as much O2 as CO, so any oxidation implies 
O2/CO > 0.5. If the O2/CO ratio was only slightly greater than 0.5, the 
oxidation would have been sluggish and we would only see thin wistite 
rims on metal in the larger spherules. Complete oxidation of the very 
small micrometeorites to magnetite suggests that the Archaean upper 
atmosphere may have been characterized by O; levels approaching 
those of the present-day Earth, requiring relatively abundant CO, asa 
source of Op, and elevated ratios of Oz to CO. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 


Received 30 May 2015; accepted 9 March 2016. 


1. Lyons, T. W., Reinhard, C. T. & Planavsky, N. J. The rise of oxygen in Earth’s early 
ocean and atmosphere. Nature 506, 307-315 (2014). 

2. Farquhar, J., Zerkle, A. L. & Bekker, A. in Treatise in Geochemistry Vol. 6, 

The Atmosphere — History (eds Holland, H. & Turekian, K.) 91-138 
(2014). 

3. Som, S. M., Catling, D. C., Harnmeijer, J. P, Polivka, P. M. & Buick, R. Air density 
2.7 billion years ago limited to less than twice modern levels by fossil raindrop 
imprints. Nature 484, 359-362 (2012). 

4. Pavlov, A. A. & Kasting, J. F. Mass-independent fractionation of sulfur isotopes 
in Archean sediments: strong evidence for an anoxic Archean atmosphere. 
Astrobiology 2, 27-41 (2002). 

5. Rasmussen, B. & Buick, R. Redox state of the Archean atmosphere: evidence 
from detrital heavy minerals in ca. 3250-2750 Ma sandstones from the 
Pilbara Craton, Australia. Geology 27, 115-118 (1999). 

6. Rye, R. & Holland, H. D. Paleosols and the evolution of atmospheric oxygen: 

a critical review. Am. J. Sci. 298, 621-672 (1998). 

7. Bekker, A. et al. lron formation: the sedimentary product of a complex interplay 
among mantle, tectonic, and biospheric processes. Econ. Geol. 105, 467-508 
(2010). 

8. Buick, R. The antiquity of oxygenic photosynthesis: evidence from 
stromatolites in sulphate-deficient Archaean lakes. Science 255, 74-77 
(1992). 

9. Eigenbrode, J. L. & Freeman, K. H. Late Archean rise of aerobic microbial 
ecosystems. Proc. Nat! Acad. Sci. USA 103, 15759-15764 (2006). 

10. Konhauser, K. O. et al. Oceanic nickel depletion and a methanogen famine 
before the Great Oxidation Event. Nature 458, 750-753 (2009). 


238 | NATURE | VOL 533 | 12 MAY 2016 


11. Farquhar, J. & Wing, B. A. Multiple sulfur isotopes and the evolution of the 
atmosphere. Earth Planet. Sci. Lett. 213, 1-13 (2003). 

12. Cioni, R., Marianelli, P. Santacroce, R. & Sbrana, A. in Encyclopedia of Volcanoes 
(ed. Sigurdsson, H.) 477-494 (Academic, 2000). 

13. Finlayson-Pitts, B. & Pitts, J. Chemistry of the Upper and Lower Atmosphere: 
Theory, Experiments, and Applications (Elsevier, 1999). 

14. Love, S. G. & Brownlee, D. E. Heating and thermal transformation of 
micrometeoroids entering the Earth's atmosphere. Icarus 89, 26-43 (1991). 

15. Rietmeijer, F. J. M. & Nuth, J. A. Collected extraterrestrial materials: 
constraints on meteor and fireball compositions. Earth Moon Planets 82-83, 
325-350 (2000). 

16. Genge, M., Engrand, C., Gounelle, M. & Taylor, S. The classification of 
micrometeorites. Meteorit. Planet. Sci. 43, 497-515 (2008). 

17. Tianrui, S., Zhengjun, H., Yusheng, W. & Yanxue, L. A study of Mesoproterozoic 
iron cosmic micro-spherules from 1.8 Ga and 1.6 Ga old strata in the 
Ming Tombs District, Beijing. Acta Geol. Sin. 81, 649-657 (2007). 

18. Awramik, S. M. & Bucheim, H. P. A giant, Late Archean lake system: 
the Meentheena Member (Tumbiana Formation; Fortescue Group), 

Western Australia. Precambr. Res. 174, 215-240 (2009). 

19. Blake, T. S., Buick, R., Brown, S. J. A. & Barley, M. E. Geochronology of a Late 
Archaean flood basalt province in the Pilbara Craton, Australia: constraints 
on basin evolution, volcanic and sedimentary accumulation, and continental 
drift rates. Precambr. Res. 133, 143-173 (2004). 

20. Trendall, A. F., Compston, W., Nelson, D. R., De Laeter, J. R. & Bennett, V. C. 
SHRIMP zircon ages constraining the depositional chronology of the 
Hamersley Group, Western Australia. Aust. J. Earth Sci. 51, 621-644 (2004). 

21. Rudraswari, N. G. et a/. Refractory metal nuggets in different types of 
cosmic spherules. Geochim. Cosmochim. Acta 131, 247-266 (2014). 

22. Chevrier, V., Rochette, P, Mathe, P-E. & Grauby, O. Weathering of iron-rich 
phases in simulated Martian atmospheres. Geology 32, 1033-1036 (2004). 

23. Kasting, J. F. in Treatise in Geochemistry Vol. 6, The Atmosphere — History 
(eds Holland, H. & Turekian, K.) 157-175 (2014). 

24. Abuluwefa, H. T., Guthrie, R. |. L. & Ajersch, F. Oxidation of low carbon steel in 
multicomponent gases: Part |. Reaction mechanisms during isothermal 
oxidation. Metall. Mater. Trans A 28, 1633-1641 (1997). 

25. Bredesen, R. & Kofstad, P. On the oxidation of iron in CO2 + CO mixtures. 

Ill: Coupled linear parabolic kinetics. Oxidat. Metals 36, 27-56 (1991). 

26. Genge, M. The origins of |-type spherules and the atmospheric entry of iron 
micrometeoroids. Meteorit. Planet. Sci. (in the press). 

27. Legras, B., Joseph, B. & Lefevre, F. Vertical diffusivity in the lower stratosphere 
from Lagrangian back-trajectory reconstructions of ozone profiles. J. Geophys. 
Res. 108, 4562, http://dx.doi.org/10.1029/2002JD003045 (2003). 

28. Fulchignoni, M. et al. In situ measurements of the physical characteristics of 
Titan’s environment. Nature 438, 785-791 (2005). 

29. Zahnle, K., Claire, M. & Catling, D. The loss of mass-independent fractionation 
in sulfur due to a Palaeoproterozoic collapse of atmospheric methane. 
Geobiology 4, 271-283 (2006). 

30. Krull-Davatzes, A. E., Byerley, G. R. & Lowe, D. R. Evidence for a low-O2 Archean 
atmosphere from nickel-rich chrome spinels in 3.24 Ga impact spherules, 
Barberton greenstone belt, South Africa. Earth Planet. Sci. Lett. 296, 319-328 
(2010). 

31. Goldin, T. J. & Melosh, H. J. Self-shielding of thermal radiation by Chicxulub 
impact ejecta: firestorm or fizzle? Geology 37, 1135-1138 (2009). 


Acknowledgements We thank N. Wilson and A. Langendam for assistance with 
electron microprobe work and electron microscopy, respectively. The authors 
acknowledge use of the Monash Centre for Electron Microscopy, and use of 
the CSIRO Microbeam Laboratory. Part of this research was undertaken on the 
powder diffraction beamline at the Australian Synchrotron, Victoria, Australia. 
M.G. acknowledges STFC grant number ST/JOO1260/1. 


Author Contributions A.G.T. conceptualized the project, conducted 

fieldwork and EMP analysis, and wrote the paper. L.B. conducted fieldwork, 
micrometeorite separation and SEM analysis. M.G. generated the 
micrometeorite oxidation model. S.A.W. advised on micrometeorite separation, 
conducted Raman spectroscopy and interpreted synchrotron results. H.E.A.B. 
conducted the synchrotron analysis. J.L.W. modelled the oxidizing conditions 
imposed at equilibrium by different atmospheres. All authors reviewed the 
paper prior to submission. 


Author Information Reprints and permissions information is available at 
www.nature.com/reprints. The authors declare no competing financial 
interests. Readers are welcome to comment on the online version of the 
paper. Correspondence and requests for materials should be addressed to 
A.G.T. (andrew.tomkins@monash.edu). 


© 2016 Macmillan Publishers Limited. All rights reserved 


METHODS 


Extraction and characterization. The weathered exterior of the limestone blocks 
was removed using a diamond saw. The resulting samples (3.6 kg) were then 
crushed to <1 cm fragments, then bathed in 10% HCl (samples 1.1, 1.3a) or 20% 
HCl (sample 1.2) for 1-2 days. After wet sieving the residues into 20-125, 125-355, 
355-1,000 and >1,000\.m size fractions, magnetic separation and hand picking 
under a binocular microscope were used to isolate micrometeorites. Care was 
taken at all stages to keep the samples covered, to avoid contamination by present 
day micrometeorites. The isolated grains were then confirmed or rejected as 
micrometeorites through imaging using a JEOL 7001F FEG-SEM. Micrometeorites 
were then embedded in epoxy and sectioned for further imaging and electron 
microprobe (EMP) analysis using a JEOL 8500F HyperProbe operating at 15kV 
accelerating voltage and 20 nA beam current. Eleven micrometeorites were 
sectioned effectively due to their small size. 

Raman spectroscopy was used to confirm the EMP-determined identity of 
wiistite in Fig. 1f. Raman spectra were collected using a Renishaw ‘Invia Raman 
microscope fitted with a coherent 632.8 nm HeNe laser. The 520.5 cm! band of 
a silicon wafer was used to calibrate the instrument before analysis of samples. 
Spectra were recorded between 2,000 and 100cm™! with a resolution of 1-2cm7! 
using a 50 objective lens. An initial spectrum was recorded using a power of 
~1.5mW at the sample surface and an exposure time of 60s for 4 accumulations. 
The resulting spectrum (Extended Data Fig. 3a) is characterized by a broad, 
asymmetrical band at 650-660 cm~!, which is consistent with the spectrum of 
wiistite*? (Extended Data Fig. 3b). Although magnetite also produces a band 
in the vicinity of 660-680 cm ', the diagnostic peaks of magnetite at ~300 and 
~535cm7! are not observed in this spectrum. A second test of the wiistite iden- 
tification is possible because wiistite decomposes rapidly to haematite when 
exposed to 632.8 nm radiation at ~7.0 mW, whereas magnetite is stable at these 
conditions*”. Thus a second spectrum was recorded using ~7.5 mW at the sam- 
ple surface, and a single accumulation with an exposure time of 50s. In this 
acquisition, the targeted iron oxide phase immediately decomposed to produce 
the characteristic spectrum of haematite (Extended Data Fig. 3a), confirming 
the identity of wiistite. 

An additional six whole micrometeorites were mounted on Hampton Scientific 
CryoLoops to investigate whether multiple phases could be identified within whole 
micrometeorite samples via powder X-ray diffraction (XRD). Data were collected 
on these using the Powder Diffraction beamline at the Australian Synchrotron. 
The penetration depth of the X-ray beam employed (15.0 keV) is on the order 
of 501m in magnetite, which allows for non-destructive sampling of the entire 
volume of individual micrometeorites. Data were collected with the MYTHEN-II 
strip detector. Mineral phases were identified with reference to the ICDD PDF-2 
database using the EVA V.1 software package available from Bruker AXS. Owing to 
their small size, reliable XRD data could only be obtained for two micrometeorites; 
results are shown in Extended Data Fig. 4). 

To evaluate the oxidizing potential of different atmospheres, equilibrium gas 
speciation (and hence oxygen fugacity, fO2) was determined by Gibbs free energy 
minimization calculations using the HSC Chemistry (v. 6.1) software package. 
Modelling was undertaken between 1,250 and 2,500°C, and 1 to 1 x 10-° bar 
pressure, assuming ideal gas mixing. For the CO2-CO and CO, gas mixtures spe- 
cies were limited to O2, CO and COs, and Nj is considered to have no oxidizing 
or reducing capacity. Air species included in the modelling were CO, CO2, H, H2, 
HNO, HNO;, HNO3, HOp, H20, N, N2, NO, NO2, NO3, NO, O, O2, O3, OH, Ar. 
Results for the range of dynamic ram pressures experienced by micrometeorites 
are shown in Extended Data Fig. 5. 

Modelling. The model used to calculate the extent of oxidation of iron metal during 
atmospheric entry is based on the model of ref. 14 and numerically integrates the 
partial differential equations for motion, heat and mass of particles decelerating in 
the atmosphere. Evaporation of particles is modelled using the Langmuir equation 
with coefficients for iron metal and iron oxide melts derived from experimental 
studies**. Evaporative mass loss is used in the model to calculate the change in 
mass and radius of the particle, which directly influences its deceleration. The 
average density of the particles is assumed to be that of wiistite, given that most of 
the observed particles are dominated by iron oxides and that the density of wiistite 
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is between that of magnetite and iron. The latent heat of evaporation is considered 
in evaluating the temperature of the particle, together with heat loss by thermal 
radiation and energy input due to incident atmospheric molecules. As in Love and 
Brownlee", the trajectory of the particle, derived from the equations of motion, 
is used to trace change in altitude at each time step and thus atmospheric density. 
Time steps of 0.01 s were used for these simulations; results are independent of time 
step size to within 1% for the peak temperature of particles. The degree of oxidation 
is calculated from the mass of oxygen that particles encounter during deceleration, 
moderated by loss of oxygen by evaporation of iron oxide liquid. This calculation 
recognizes that the amount of oxygen accreted by iron micrometeorites can be 
no more than the mass of oxygen the particles encounter during deceleration. It 
also uses the observation that iron oxide liquid mantles the iron metal core of the 
particle due to surface tension effects resulting in evaporation of oxide rather than 
metal. Accretion of oxygen is also assumed to only occur when particles are molten 
since diffusion rates within solid metal are too slow to allow noticeable oxidation 
over the few seconds of deceleration. 

Given the dependence of the extent of oxidation on both particle temperature 
and mass of oxygen encountered, the atmospheric density-altitude profile is 
important. In the absence of a consensus density model of the Archaean atmos- 
phere, the 1976 US Standard Atmosphere is used in the simulations. Some models 
of the Archaean atmosphere, however, suggest that density variation with altitude 
was similar to the modern day’. Changes in the exact atmospheric density with 
altitude, however, can be seen to change the mass of gas encountered over molten 
flight relatively little. A denser Archaean atmosphere results in a larger mass of 
gas encountered per second, but particle deceleration would also occur more 
rapidly, giving a shorter period for accretion of oxygen. Conversely a less dense 
Archaean atmosphere results in a smaller mass of gas encountered per second but 
less deceleration, resulting in a longer period over which particles accrete oxygen. 
The results of 12,000 simulations with and without oxidation suggest that iron 
micrometeoroids undergo maximum heating at altitudes equivalent to 65-90 km; 
that is, within the current day mesosphere. 

The abundance of oxygen in the Archaean atmosphere in the simulations was 
varied by scaling the modern day oxygen abundance with altitude. The effects 
of other gas species on oxidation or reduction were not considered within the 
model. Carbon monoxide, if present within the Archaean atmosphere, would 
oppose oxidation by driving reduction of iron oxide melt and production of CO. 
At equilibrium, if the O2/CO ratio was <0.5, the metal would not be oxidized (that 
is, 2Fe + O. = 2FeO versus FeO + CO=Fe+ COs). The abundances of oxygen used 
within the model can therefore be considered to be those in excess of reducing 
species in the atmosphere, although in a disequilibrium scenario of micrometeorite 
entry the rate of oxidation by O2 may be faster than the rate of reduction by CO. 
The important criteria within the oxidation model are the abundance of oxygen 
compared to that in the present atmosphere, the atmospheric density profile and 
the molten flight time of the particle, and the entry parameters that determine peak 
temperature and altitude of peak temperature. The objective of the simulations is 
not to provide an accurate and realistic model of Archaean atmospheric chemistry, 
but simply to estimate the abundance of free oxygen available to produce net 
oxidation, which is required to explain the presence of magnetite-rich Archaean 
I-type spherules. Even if the mass of oxygen accreted by particles predicted by the 
simulations is subject to an error of an order of magnitude, the results would still 
necessitate abundant oxygen within the Archaean upper atmosphere relative to 
the model shown in Fig. 4. 

Code availability. The custom entry heating model code used in this paper was 
developed by M.G. and is not currently publicly available. For further details on 
the code see ref. 26. 

Sample size. No statistical methods were used to predetermine sample size. 
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some iron oxides and oxyhydroxides. J. Raman Spectrosc. 28, 873-878 
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Extended Data Figure 1 | Geological map showing the context of it was not weathered, allowing survival of the micrometeorites. Weathered 
the sampling location (red star), and examples of the samples used rock was removed using a diamond saw before micrometeorite separation. 
in the study. The samples in photos a and b show examples of the fine The geological map (top) is provided courtesy of the Geological Survey of 
laminations that occur in some layers of this unit. In a, buff coloured zones Western Australia, Department of Mines and Petroleum. Copyright State 
along cracks highlight examples of modern day oxidative weathering, of Western Australia, 2016. 


whereas the grey colouration of the remainder of the sample indicates that 
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Extended Data Figure 2 | Examples of modern iron-type micrometeorite in cross-section where the metal (outlined by the dashed 
micrometeorites collected from the Antarctic ice sheet. a, b, Secondary yellow line) has been replaced by iron oxides and iron oxyhydroxides after 
electron images showing the typical exterior morphology. c, d, Back- arriving on the surface; expansion has led to cracking in the surrounding 
scattered electron (BSE) images of polished cross-sections of two other wiistite and magnetite, which would destroy the micrometeorite if 
micrometeorites, highlighting the interior mineralogical variation. continued. Note that the wiistite and magnetite are unaffected by the 

e, BSE image showing an example of a partially weathered modern weathering. All imaging by M.G. 
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Extended Data Figure 3 | Results of laser Raman spectroscopy 
confirming the identity of wiistite. Wiistite is metastable below 570°C 
and decomposes to haematite when higher laser power is used, whereas 
magnetite does not decompose in this fashion*”. The iron oxide in the 
sectioned micrometeorite shown in Fig. 1f gave the spectrum shown in 
a at 1.5mW and decomposed to produce the spectrum shown in a at 
7.5 mW laser power, consistent with the characteristics of wiistite. Panel 
b shows the characteristic spectra of magnetite and wiistite from ref. 32; 
note the arrowed bumps that characterize magnetite, which are missing 
in wiistite, and the broadened main peak of wiistite. Panel b is modified 
from ref. 32 with permission from John Wiley and Sons, license number 
3850481150510. 
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are detected, and deviation from the expected relative intensities of peaks 
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Extended Data Figure 5 | Equilibrium modelling of the oxidizing 
conditions imposed by different atmospheres relative to the stability 
fields of haematite, magnetite and wiistite. These models (oxygen 
fugacity versus temperature) represent the stability fields at equilibrium 
and do not consider the time needed to attain equilibrium (refer to 
Extended Data Fig. 6). In a, the top of each coloured band represents the 
conditions imposed by the maximum dynamic ram pressure (0.02 bar) 
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experienced by micrometeorites, which applies to the largest and fastest 
micrometeorites; the bottom of each band is more relevant to the small 
micrometeorites observed in this study (0.001 bar). In b, the model for 
an atmosphere containing 90% N» and 10% CO) is shown, with contours 
for pressure to allow a comparison with atmospheres of different CO» 
abundance. 
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Extended Data Figure 6 | Results of experiments on oxidation of low 
carbon steel in different gas mixes. These are the results of experiments 
conducted at 1,100°C and atmospheric pressure (1 bar); the gas mix is 
shown on the curves, the remaining gas being N>. Oxidation was measured 
by progressive weight gain over time. The conditions of survivable 
micrometeorite entry are 400-2,800 °C and dynamic ram pressure of 
the order of 0.001-0.02 bar for <2s. Higher temperatures result in more 
effective oxidation, lower pressures result in less effective oxidation. This 
figure was compiled by amalgamating figures 1 and 8 from ref. 24, with 
permission from Springer, license number 3850591244373. 
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Extended Data Figure 7 | Results of atmospheric entry modelling. velocity of 12kms~'. Particles with higher entry velocity have similar 
a-c, Plots of entry angle in degrees versus initial radius of micrometeorite peak altitudes since mass loss through evaporation leads to increased 
in jum. a, Peak temperatures reached by I-type spherules (numbers on deceleration. c, The final radii of I-type particles (numbers on curves, 
curves in °C) entering the atmosphere at 12 kms~'. Note that heating is in jum) after deceleration at entry velocities of 12kms~! (black) and 
greater for larger particles undergoing vertical atmospheric entry. The 18kms_! (grey). Greater mass loss occurs at higher entry velocities, 
high density of metal micrometeoroids leads to higher peak temperatures entry angles and particle sizes. Mass loss occurs by surface evaporation of 
than silicate-dominated particles. b, Altitude at which peak temperature the exterior oxide melt. 


is reached for I-type spherules (numbers on curves, in km) with an entry 


© 2016 Macmillan Publishers Limited. All rights reserved 


Extended Data Table 1 | Representative analyses of micrometeorite magnetite, wiistite and metal 


Sample ID 1.1(2) 1.1(2) 1.1(2) 1.1(2) 1.1(2)  1.3(a)* 
Fe (at.%) 43.46 43.82 42.74 50.07 50.35 90.16 

Ni (at.%) n.a. n.a. n.a. n.a. n.a. 6.41 

O (at.%) 56.54 56.18 57.26 49.93 49.65 3.43 

Formula Fe,0, Fe304 Fe304 FeO FeO FeNi + FeO 
Mineral magnetite | magnetite magnetite —_ wistite wistite metal + wistitet 


n.a., not analysed. 


«This analysis comes from the metal particle rimmed by wistite in Fig. le. 
TOwing to the small size of the metal particles, analyses of these invariably included some surrounding wistite in the total. 
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A rapid burst in hotspot motion through the 
interaction of tectonics and deep mantle flow 


Rakib Hassan!, R. Dietmar Miiller!, Michael Gurnis”, Simon E. Williams! & Nicolas Flament! 


Volcanic hotspot tracks featuring linear progressions in the age 
of volcanism are typical surface expressions of plate tectonic 
movement on top of narrow plumes of hot material within 
Earth’s mantle!. Seismic imaging reveals that these plumes 
can be of deep origin*—probably rooted on thermochemical 
structures in the lower mantle**. Although palaeomagnetic and 
radiometric age data suggest that mantle flow can advect plume 
conduits laterally”®, the flow dynamics underlying the formation 
of the sharp bend occurring only in the Hawaiian-Emperor 
hotspot track in the Pacific Ocean remains enigmatic. Here we 
present palaeogeographically constrained numerical models 
of thermochemical convection and demonstrate that flow in 
the deep lower mantle under the north Pacific was anomalously 
vigorous between 100 million years ago and 50 million years 
ago as a consequence of long-lasting subduction systems, unlike 
those in the south Pacific. These models show a sharp bend in the 
Hawaiian-Emperor hotspot track arising from the interplay of 
plume tilt and the lateral advection of plume sources. The different 
trajectories of the Hawaiian and Louisville hotspot tracks arise 
from asymmetric deformation of thermochemical structures under 
the Pacific between 100 million years ago and 50 million years ago. 
This asymmetric deformation waned just before the Hawaiian- 
Emperor bend developed, owing to flow in the deepest lower mantle 
associated with slab descent in the north and south Pacific. 

The sharp bend in the Hawaiian-Emperor hotspot track in the 
north Pacific has been interpreted as the result of a sudden change 
in Pacific plate motion over a fixed plume. In contrast, the Louisville 
hotspot track in the south Pacific features a more gradual arc through 
the period of the bend’. Palaeomagnetic and radiometric age data 
suggest that the Hawaiian plume underwent a phase of rapid 11°-15° 
southward motion between 81 million years (Myr) ago and 47 Myr ago’, 
while the Louisville plume remained within 5°-7° of present-day lati- 
tude between 70 Myr ago and 50 Myr ago’, suggesting that the plumes 
are moving independently®. Seismic imaging reveals that these are 
robust plumes deeply rooted in the mantle” and deep plumes are likely 
to originate from thermochemical structures*-°. Moreover, the spa- 
tial correlation of hotspot locations with the edges of large low-shear- 
velocity provinces (LLSVPs)° suggests that these plumes are likely to 
be rooted on thermochemical ridges*'”. Numerical models of thermo- 
chemical structures suggest that their edges should be mobile!!, in 
contrast to the longer-term stability implied by the correlation of the 
edges of LLSVPs and reconstructed volcanic features!?"*. 

A combination of plume motions derived from backward advection 
of mantle flow with a model of plate motions that explicitly incorpo- 
rated a major change in relative plate motions occurring 52-43 Myr 
ago has been proposed to reproduce the observed Hawaiian—Emperor 
track'®. These earlier flow models result in a substantial misfit of the 
predicted trail before 65 Myr ago!®, and predict a gradual slowdown 
of the Hawaiian plume motion, compared to the abrupt slowdown 
occurring around the time of formation of the Hawaiian—Emperor 


bend, as indicated by palaeomagnetic data. Moreover, Pacific—Farallon 
relative plate motion changes contemporaneous with the Hawaiian- 
Emperor bend were insignificant and gradual”"®, suggesting that the 
bend is primarily a consequence of rapid southward migration of the 
Hawaiian plume” !7-18, 

A capture-release mechanism has also been proposed”, in which 
the Hawaiian plume may have been captured by the fast-spreading 
Kula-Pacific ridge system, situated to the north of the Detroit 
seamount during its construction. In this scenario, vigorous shal- 
low mantle flow induced a strong plume tilt starting at mid-mantle 
depth, before cessation of spreading between 56 Myr ago and 47 Myr 
ago resulted in a change in mantle flow regime. Consequently, the 
plume conduit rapidly returned to its original position, resulting in the 
Hawaiian—-Emperor bend!’. However, an assumption implicit in this 
scenario is that the base of the plume remains fixed, relative to lower 
mantle structures, during the change in flow regime. 

Instead, here we advance the hypothesis that between about 100 Myr 
ago and 50 Myr ago deep mantle flow beneath the north Pacific was 
anomalously vigorous, a consequence of robust and long-lasting sub- 
duction systems, unlike those in the south Pacific. This asymmetric 
flow in the deepest lower mantle under the north and south Pacific 
explains the rapid southward motion of the Hawaiian plume and the 
contrasting sluggish latitudinal motion of the Louisville plume, result- 
ing in sharp differences in the geometry of hotspot tracks. 

We follow the trajectory and tilt of plumes over the past 230 Myr 
with palaeogeographically constrained, spherical numerical models 
of convection!®!°. We impose kinematic boundary conditions, con- 
sistent with plate motions, and assimilate thermal models of shallow 
slabs at convergent plate margins (Methods). Descending slab mate- 
rial deforms a basal layer of anomalous density”**! (Methods), cov- 
ering the core-mantle boundary (CMB), where the slab descent rates 
observed in our models”” are consistent with estimates inferred from 
global mantle tomography~’. Viscosity in the deepest lower mantle, 
below the viscosity peak??4, is low, owing to elevated temperatures 
in the thermal boundary layer above the CMB?!”®, and thus flow 
velocities there are expected to be higher than those in the ambient 
lower mantle. This is a consistent feature of our models, in agreement 
with observations from earlier flow models”®. Consequently, within 
the first 50 Myr of model time, the dense material above the CMB 
deforms, ultimately resembling the seismologically observed LLSVPs 
in the lower mantle (Methods, Extended Data Fig. 1). We refer to 
these dense thermochemical structures above the CMB in our mod- 
els as LLSVPs. Fully dynamic plumes nucleate primarily around the 
LLSVPs and plume eruption locations are highly correlated to 
the reconstructed eruption locations of large igneous provinces'®. The 
models are characterized roughly by as many plumes at the present day 
as can be inferred to be of deep origin from tomographic studies’. 

Subduction history has been suggested to have a first-order role in 
shaping the geometry of LLSVPs!""!*8. Here we compute poloidal 
flow, associated with the buoyancy resulting from subduction, in the 
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Figure 1 | Time evolution of model plume trajectories in a model of 
thermochemical convection. See Extended Data Table 2, case M3. a, The 
magnitude of the time-averaged mean poloidal flow (see text) is shown by 
grey shading and the corresponding flow directions are shown by black 
arrows. Subduction zones over the past 140 Myr are plotted at 20-Myr 
intervals, coloured by age. Red triangles indicate the present-day location 
of observed plumes of deep origin at the surface. The corresponding 
model plume trajectories for Hawaii (Hm), Louisville (Lm), Easter (Hm), 
Kerguelen (Ky) and Tristan (T,,), at a depth of 350 km, are coloured by age. 
These trajectories generally reflect the mean poloidal flow. Model plumes 


deep lower mantle to evaluate the temporal evolution of the edges 
of LLSVPs. We compute mean poloidal flow within a 300-km-thick 
shell above the CMB and derive a time average of this evolving mean 
poloidal flow over the past 140 Myr of modelled time (Methods), 
highlighting regions that undergo strong coherent poloidal flow rel- 
ative to regions where it is comparatively weak and diffuse. Over the 
past 140 Myr, subduction zones bounding the north Pacific migrate 
oceanwards (Fig. 1a), so the deep mantle here is dominated by a strong 
coherent flow. In contrast, flow in the south Pacific is diffuse and the 
darker region under the central Pacific marks the mean areal extent 
of the Pacific LLSVP, where mean poloidal flow is weakest (Fig. 1a). 

The southward trajectory of the Hawaiian plume is nearly per- 
pendicular to the strike of the northern edge of the Pacific LLSVP, 
marked by zones of quasi-stagnation (Fig. 1a). Southeastward flow 
beneath the northwest Pacific converges with southwestward 
flow from the northeast Pacific, resulting in a strong net southward flow 
that reaches a maximum along a corridor coincident with the location 
of the present-day Hawaiian plume. The edge of the Pacific LLSVP 
recedes southward along this corridor by more than 15° over the past 
100 Myr (Extended Data Fig. 2 and Supplementary Video 1), making 
ita unique geodynamical setting. The pulse of southward poloidal flow 
in the north Pacific during the period 100-50 Myr ago is counterbal- 
anced by the northeasterly poloidal flow under the southwest Pacific 
that strengthens after about 85 Myr ago (Extended Data Fig. 2 and 
Supplementary Video 1)—resulting in the transition from a phase of 
rapid southward motion of the model Hawaiian plume, H,,, toa much 
slower regime at about 50 Myr ago (Fig. 1b, Extended Data Fig. 3 and 
Supplementary Video 2). 

A southward velocity of around 3 cm yr7' in the deep lower man- 
tle at 90 Myr ago (Fig. 2a, bottom row) translates to over 5cm yr! 
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in the Indo-Atlantic are shorter-lived and less mobile than those in the 
Pacific. See Fig. 2 for time-dependent profiles along PP’ and QQ’. b, The 
latitudinal deviation, 6, of model plumes from their present-day locations 
at a depth of 350 km. The coloured shading shows the latitudinal offset 
between the locations of plume conduits at depths of 350 km and 1,500 km. 
Between 81 Myr ago and 47 Myr ago (shaded grey rectangle), Hm rapidly 
moves southward by about 10°. c, Model plume tilt, as outlined in the 
Methods. Between about 81 Myr ago and 47 Myr ago, tilt accumulated by 
Hy over an earlier period of coherent southward motion decreases sharply. 
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near the surface, consistent with palaeomagnetic estimates’. Over the 
same period, the edge of the LLSVP in the south Pacific—marked by 
the trough in velocity (Fig. 2a—d, bottom row)—remains compara- 
tively stable. The Louisville plume is thus isolated from the kind of 
rapid burst and slowdown of motion that caused the sharp Hawaiian- 
Emperor bend. 

A dynamical explanation for the sharp bend in the Hawaiian- 
Emperor track requires an abrupt slowdown of the plume at the time 
of the bend’s formation’* that cannot be accounted for by deep low- 
er-mantle flow alone. Ascent speed within plume conduits is low in 
the vicinity of the viscosity peak in the lower mantle and is comparable 
to rates of horizontal advection, whereas in the shallow lower mantle 
and above, the rate of ascent dominates the motion of a plume conduit. 

Consequently, the lateral advection of plume sources rooted on 
chemical ridges in the deep lower mantle is the primary factor con- 
trolling plume tilt. Between about 81 Myr ago and 47 Myr ago, the sur- 
face location of Hm moves southward by about 10°, followed by a sharp 
slowdown, whereas during this period, the base of the plume moves 
only about 6° (Fig. 1b). The remaining ~4° of motion near the surface 
arises from the slowdown of southward motion of the base of Hyp. 
Consequently, the latitudinally tilted plume rapidly straightens, 
resulting in an abrupt slowdown in hotspot motion near the surface 
at about 47 Myr ago (Fig. 1c, Extended Data Fig. 3 and Supplementary 
Video 2). Predicted hotspot tracks are nearly identical to the observed 
Hawaiian-Emperor track (Fig. 3a) and the latitudinal motion of Hy 
corresponds to the much faster southward motion of the Hawaiian 
plume before the Hawaiian-Emperor bend occurs (Fig. 3b), consistent 
with palaeomagnetic data’. 

Numerical experiments computed over a range of uncertain 
parameters (Methods, Extended Data Tables 1 and 2) demonstrate a 
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Figure 2 | Longitudinal cross-sections. Cross-sections through Hm 

(see Fig. 1a) show its southward motion and the evolution of tilt that 
arises from the offset of the base of the plume relative to conduit location 
near the surface (see Fig. 1b and c). Ascent speeds within model plume 
conduits wane towards the present-day values as plumes mature. This 

is consistent with large volumes of lava being emplaced on Earth's surface 
when a plume-head erupts, followed by weaker magmatic activity associated 
with a waning plume-tail. a, Temperature profile along a constant 

meridian, through H,,, at age 90 Myr, for latitudinal extent shown. 


consistent pattern of strong time-averaged poloidal flow in the north 
Pacific compared to that in the south (Extended Data Fig. 4). Rapid 
southward migration of H,, is observed when the density contrast of 
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Figure 3 | Predicted hotspot tracks. a, The thick multicoloured trajectory 
represents the motion of the Hawaiian plume, based on the motion of 
Hm (see Fig. 1). The thin multicoloured trajectory represents the same 
from a similar model (see Extended Data Table 2, case M4), where the 
plate reconstruction is based on a reference frame constrained by 
lower-mantle slab remnants (Methods). The solid and dashed black lines 
represent predicted surface hotspot tracks, computed from the thick 

and thin multicoloured trajectories of H,, respectively (Methods). 

White diamonds along these hotspot tracks mark 10-Myr intervals. 

The background shows gravity anomaly to reveal the Hawaiian-Emperor 
seamount chain. b, The solid and dashed black lines show palaeolatitudes 
corresponding to the thick and thin multicoloured trajectories in 

a, respectively. Red circles are inferred palaeolatitudes for the Emperor 
seamounts and vertical error bars represent 95% confidence limits’. 

See Tarduno et al.’ for a description of horizontal error bars and lists of 
data sources. 
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The temperature field away from radial layer averages in the panel is 
contoured in red by the depth-averaged standard deviation of radial layer 
temperature values, thus outlining the plume conduit. The thick black 
contour marks 75% anomalous chemical concentration. Velocity vectors 
outside the plume conduit are shown in lighter shading. The bottom panel 
shows the magnitude of mean poloidal flow in a 300-km-thick shell 

above the CMB, at age 90 Myr, along profiles PP’ and QQ’ (see Fig. 1a). 
b-d, As for a, but for ages as given. 


the LLSVP material ranges between 2% and 2.5%; for larger density 
contrasts (Extended Data Fig. 4f) and when the LLSVP material is 
intrinsically more viscous (Extended Data Fig. 4e), the areal extent of 
the Pacific LLSVP is greater and the plume motion is smaller. Plume 
mobility could thus be a criterion to constrain the density contrast of 
LLSVP material. 

We demonstrate that for a range of reasonable parameter values, 
the Pacific LLSVP deforms asymmetrically between 100 Myr ago and 
50 Myr ago and that predicted hotspot tracks explain the formation of 
the sharp Hawaiian-Emperor bend, without requiring a major change 
in plate motion. We note that our present models do not capture the 
potential influences of a waning ridge system in the vicinity of a plume 
and that more sophisticated convection models—featuring self- 
consistent ridges and transform boundaries—are necessary to test the 
dynamic plausibility of a capture-release mechanism’” for plumes 
rooted on mobile lower mantle structures. Our results imply that the 
Hawaiian plume may have been active since approximately 140 Myr 
ago, longer than previously recognized, and that the remnants of this 
activity would have been subducted beneath the Kamchatka penin- 
sula”’. Even though there is robust evidence for the overall stability of 
the African LLSVP for times when Africa was relatively stationary and 
distant from migrating subduction zones”, our results demonstrate 
that a tectonic setting of fast-moving oceanic plates surrounded by a 
migrating, dynamic set of subduction zones can lead to substantial 
LLSVP deformation and hotspot mobility. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 


The numerical model. We consider models of thermochemical convection within 
Earth’s mantle under the extended-Boussinesq approximation*! in a spherical shell, 
with depth-dependent thermodynamic properties and temperature- and depth- 
dependent rheology. The equations for the conservation of mass, momentum and 
energy are solved using Citcom$* as a finite-element problem, which has been 
modified to allow for progressive assimilation of surface plate motion and inferred 
slab material based on global plate reconstructions’’. Our choice of model param- 
eters and underlying assumptions have been outlined in earlier work!!! Extended 
Data Table 1 lists parameters held fixed across all model cases and additional details 
can be found in our earlier work’. 
Model setup. The spherical shell representing Earth’s mantle is decomposed into 
about 12.6 million mesh elements. Radial mesh refinement provides a vertical 
resolution of about 15km and about 27 km near the top and bottom boundary 
layers, respectively. We assume an a priori mantle adiabat augmented by thermal 
boundary layers. The top and the bottom thermal boundary layers each encompass 
a temperature drop of 1,225 K and the initial adiabatic temperature profile has a 
potential temperature of 1,525 K. A non-dimensional internal heat generation rate 
of 100 is applied in all model cases. We compute a reference profile for thermal 
expansion, a, based on analytical parameterizations given in ref. 33, using the a 
priori mantle temperature profile. 

We use piecewise Arrhenius laws to describe the variation of viscosity with 
temperature, depth and composition in Earth’s mantle, which takes the following 
non-dimensional form: 


E,(r) + el = Tinner) V,(r) 
1+ Torr 


n(T,r) =A(r)n, exp (1) 
where 77 is the viscosity, T is the temperature, r is the radius, A is the pre- 
exponential parameter, 17, is the intrinsic composition-dependent pre-factor, E, is 
the activation energy, V, is the activation volume, Toi is the temperature offset and 
Tinner is the radius at the CMB. The second term within the exponential ensures that 
it reduces to 1 when T= 1 and r= inner For the lower mantle, we use a dimensional 
activation energy of 320 KJ mol! and activation volume of 6.7 x 10-°m? mol}, 
corresponding to non-dimensional units of 11 and 26, respectively, comparable 
to estimates in ref. 34. However, such viscosity parameterizations lead to large 
viscosity variations, causing numerical difficulties. To limit the viscosity contrast 
to three orders of magnitude, we adjust the pre-exponential parameter A(r) and 
the temperature offset Toi (ref. 35). The resulting viscosity profile is similar to the 
preferred viscosity profiles of ref. 23. Additional details on model setup can be 
found in our earlier work". 

Initial and boundary conditions. We apply kinematic surface boundary con- 
ditions that are a function of both the relative plate motion model and the abso- 
lute plate motion model. We use the relative plate motion from ref. 36, updated 
for circum-Arctic regions as described in ref. 37. In most model cases we use 
the absolute plate motion model of ref. 30 for the period 0-70 Myr ago, and the 
absolute plate motion model of ref. 38 for the period 105-200 Myr ago, with an 
interpolation between 70 Myr ago and 105 Myr ago. We refer to this hybrid absolute 
plate motion model as APM1 in Extended Data Table 2. We also test a kinematic 
scenario using the same relative plate motion model combined with an absolute 
plate motion model constrained by lower mantle slab remnants”, which we refer 
to as APM2 in Extended Data Table 2. Surface velocities derived from these global 
plate tectonic reconstructions are obtained in 1-Myr intervals with a linear inter- 
polation in between. 

We constructed a thermal model of slabs from the reconstructed location and 
age of oceanic lithosphere at convergent plate margins. These slabs are initially 
inserted from the surface to a depth of 1,200 km. Slabs that appear during mod- 
elled geological time are progressively inserted into the upper mantle on the basis 
of their age of appearance. This imposed thermal structure is blended with the 
dynamically evolving temperature field at each timestep—see Bower et al.'"'° for 
a more detailed description of the progressive data assimilation method. This sim- 
ple approach captures the essential aspects of subduction by injecting slabs with 
realistic thickness and mass flux, without requiring complex rheological laws to 
model the physics of plate boundaries in our models. 

To evaluate the influence of parameter values and initial conditions we have 
computed a range of models in which we varied the density contrast of the dense 
chemical layer above the CMB, the intrinsic viscosity contrast of the dense mate- 
rial, the initial geometry of the dense layer and the plate reconstruction that 
dictates surface plate velocities and the location of subduction zones. Extended 
Data Table 2 lists the model cases presented in this study. In most model cases, 
an initially uniform layer of anomalously dense material about 100 km thick cov- 
ers the CMB. However, before the Mesozoic era, the dense LLSVP material may 
have already been displaced and deformed by slabs!*”. To test the influence of 
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an already thickened dense layer on model outcomes, we present a model case 
(M5 in Extended Data Table 2) where the dense material is confined to two slightly 
elongated domes—resembling present-day LLSVPs—in the initial condition. These 
domes are about 1,000 K hotter than the ambient mantle!! and have a relief of 
about 450 km, with their combined volume comparable to estimates of the volume 
of LLSVPs}’. 

Poloidal flow in the lower mantle. The velocity field u in our global models is 
divergence free: 


V-u=0 (2) 


This allows us to decompose the tangential velocity field on each radial shell of the 
spherical mesh into poloidal and toroidal components. The poloidal flow originates 
from buoyancy-driven convergent or divergent flows, such as at spreading centres 
and subduction zones. Following from Helmholtz’s theorem on spheres“: 


Y= VE+ Ve x (Pr) (3) 


where V, is the tangential velocity on a radial shell of the computational domain, 
V, is the gradient operator on a spherical shell, & is the poloidal potential, Wis the 
toroidal potential and r is the radial unit vector. See ref. 40 for more details. The 
poloidal potential & is obtained as follows: 


=V(%-V) (4) 
Subsequently, the poloidal flow field V, on a radial shell is obtained as: 
y= Ve (5) 


Model hotspot tracks and plume tilt. A detailed account of a scheme for detecting 
plumes in global mantle convection models is given in ref. 10. We process model 
outputs at 5-Myr intervals, since the transit time of a plume head from nucleation 
to eruption under the base of the lithosphere is always >5 Myr. We compute a set 
of extant plume conduit locations, S; = {P), ..., P,}, at a depth of 350 km, based on 
the plume detection scheme, where subscript n denotes the total number of plumes 
detected at a given model time t. Age-progressive model hotspot tracks in the 
mantle frame of reference are subsequently identified by binning conduit locations 
from the ensemble set S = {S;,, ..., S;,,| based on spatiotemporal proximity, where 
to is the model time when the first model plumes appear and f,, is the model time 
corresponding to the present day. 

A fourth-order Runge-Kutta scheme is used to compute streamlines passing 
through each P; € S; and spanning mantle depths of 350-1,500 km that mark the 
topological skeletons of extant plume conduits at a given model time. We limit the 
depth extent of these streamlines to 1,500 km, since plumes are rooted on chemical 
ridges that can reach more than about 1,000 km above the CMB. Plume tilt is then 
computed based on P; and its corresponding location at 1,500 km depth, relative 
to the radial normal. 

Predicted hotspot tracks. The motion of a model plume in the mantle frame of 
reference can be represented implicitly by a sequence of finite rotations on the 
sphere, decoupling it from explicit spatial locations. By applying such motion 
trajectories obtained from convection models—such as H, (Fig. 1a)—to the cor- 
responding observed plume (Hawaii), a direct comparison can be made between 
the predicted and the observed hotspot tracks. 

Comparison of model LLSVPs with tomography. Extended Data Fig. 1 shows 
a comparison of present-day model LLSVPs for case M3 (Extended Data Table 2) 
with the Savani tomography model". The first-order shapes of both the African 
and Pacific LLSVPs are largely in agreement with our model. The western extension 
of the Pacific LLSVP is much smaller in north-south extent than in the centre, 
consistent with our model. The Pacific LLSVP in the east is broken up into blobs, 
where the details are a little different compared to our model, but it shows that in 
the east this feature is more discontinuous. However, model LLSVP edges in the 
vicinity of the actual and model Hawaiian plumes agree well with the tomography 
(Extended Data Fig. 1). 

In the African hemisphere, the tomography model shows the long 
east-west-oriented eastern arm of our African model LLSVP in the southern central 
Indian Ocean, thickening to the west as it straddles South Africa, and a more 
discontinuous northern extension where we have a narrow arm stretching towards 
Iceland. The ‘Perm Anomaly”” beneath present-day western Siberia appears as 
a rounded feature in the tomography model, as opposed to the more elongate 
corresponding feature in our model—though both are isolated and separated 
from the northern part of the African LLSVP. This comparison suggests that 
our models do a reasonable job of capturing lower mantle dynamics—especially 
considering the relative simplicity of the models—and it is not expected that 
model LLSVP edges would match higher-order geometric features found in man- 
tle tomography models. 
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Extended Data Figure 2 | Evolution of mean poloidal flow. In each panel, 
the magnitude of mean poloidal velocity in a 300-km-thick shell above 

the CMB is shown in grey shading and corresponding flow directions are 
shown by black arrows at the age labelled, for case M3 (Extended Data 
Table 2). Edges of the model LLSVPs are marked by contours of the 75% 
chemical concentration isosurface, at labelled heights above the CMB. 
Subduction zones are shown in yellow and the white rectangular region 
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marks the extent of the three-dimensional plots in Extended Data Fig. 3. 
In the bottom row of each panel, cross-sections along cyan profiles 
through the Pacific LLSVP show the evolution of its edges driven by 
subduction-induced flow. Velocity vectors in these cross-sections have 
been clipped to 6cm yr! and the black contours show 75% chemical 
concentration. 
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Extended Data Figure 3 | Trajectory of model Hawaiian plume. Three- 
dimensional (Cartesian projection of spherical geometry) perspectives 
showing the southward motion and evolution of tilt for model plume 
corresponding to Hawaii (Hm) in case M3 (Extended Data Table 2). 
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The black contour marks the 75% chemical concentration isosurface 

100 km above the CMB. The temperature field above layer averages, 5T, is 
isosurfaced at a value of 0.1 to delineate plume conduits. The top 200 km of 
the domain is not rendered, in order to avoid visual clutter. 
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Extended Data Table 1 | Physical parameters and constants 


Parameter Symbol Value Units 
Rayleigh number Ra 5 x 10° - 
Earth radius Ro 6371 km 
Density po 3930 kgm 
Thermal expansivity ag 1.42 x 10-5 K~" 
Thermal diffusivity Ko 1x 107° m?s—! 
Specific heat capacity Cp 1100 Jkg-' K~1 
Gravitational acceleration g 10 ms? 
Surface Temperature Ts 300 K 
Dissipation number Di 0.8 - 
Reference Viscosity No 1x 107! Pas 
Internal Heating H 100 - 
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Extended Data Table 2 | Model cases 
APM Model 


Case A/pcn% 


M1 2.125 
M2 2:25 
M3 2.5 
M4 2.5 
MS 2.5 
M6 2.5 
M7 3.0 


APM1 
APM1 
APM1 
APM2 
APM1 
APM1 
APM1 


Ne 


1 
1 
1 
1 
1 
5 
1 


Geometry of Dense Layer 


Uniform 
Uniform 
Uniform 
Uniform 
Domed 
Uniform 
Uniform 


Apcn is the density contrast of the dense chemical layer above the CMB. 
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First North American fossil monkey and early 
Miocene tropical biotic interchange 


Jonathan I. Bloch!, Emily D. Woodruff!, Aaron R. Wood", Aldo F. Rincon, Arianna R. Harrington), Gary S. Morgan®, 
David A. Foster+, Camilo Montes’, Carlos A. Jaramillo®, Nathan A. Jud!, Douglas S. Jones! & Bruce J. MacFadden! 


New World monkeys (platyrrhines) are a diverse part of modern 
tropical ecosystems in North and South America, yet their early 
evolutionary history in the tropics is largely unknown. Molecular 
divergence estimates suggest that primates arrived in tropical 
Central America, the southern-most extent of the North American 
landmass, with several dispersals from South America starting 
with the emergence of the Isthmus of Panama 3-4 million years 
ago (Ma)!. The complete absence of primate fossils from Central 
America has, however, limited our understanding of their history 
in the New World. Here we present the first description of a fossil 
monkey recovered from the North American landmass, the oldest 
known crown platyrrhine, from a precisely dated 20.9-Ma layer in 
the Las Cascadas Formation in the Panama Canal Basin, Panama. 
This discovery suggests that family-level diversification of extant 
New World monkeys occurred in the tropics, with new divergence 
estimates for Cebidae between 22 and 25 Ma, and provides the 
oldest fossil evidence for mammalian interchange between South 
and North America. The timing is consistent with recent tectonic 
reconstructions~® of a relatively narrow Central American Seaway 
in the early Miocene epoch, coincident with over-water dispersals 
inferred for many other groups of animals and plants*. Discovery 
of an early Miocene primate in Panama provides evidence for a 
circum-Caribbean tropical distribution of New World monkeys by 
this time, with ocean barriers not wholly restricting their northward 
movements, requiring a complex set of ecological factors to explain 
their absence in well-sampled similarly aged localities at higher 
latitudes of North America. 

The Miocene epoch (23.8-5.3 Ma) is marked by substantial climatic 
and ecological changes that had profound effects on terrestrial mam- 
mal communities in the New World tropics”. Fossils from the tropical 
lowlands of Central America are rare owing to a lack of relevant rock 
exposures; however, an important exception can be found in Panama 
where, since 2009, expansion of the Panama Canal has exposed fossil- 
bearing rocks of early Miocene age. The lower Miocene Las Cascadas 
Formation (Fig. 1) represents the oldest fossiliferous continental 
sequence exposed along the Panama Canal area’ and includes a diverse 
fossil mammal assemblage*? that, while compositionally different 
from that of the younger overlying Centenario Fauna"®, similarly has 
almost entirely North American affinities despite its proximity to South 
America, consistent with a peninsular connection to North America!®!!, 
Palaeontological fieldwork in 2012-2013 resulted in the discovery of 
seven isolated fossil primate teeth that shed new light on the early 
evolution of crown New World monkeys in the tropics and pro- 
vide insight into the timing and dynamics of the earliest stages of the 
Great American Biotic Interchange (GABI) between North and South 
America’. 


Primates Linnaeus, 1758 
Anthropoidea Mivart, 1864 
Platyrrhini Geoffroy, 1812 
Cebidae Bonaparte, 1831 
Panamacebus transitus gen. et sp. nov. 


Etymology. Generic name combines ‘Panama with ‘Cebus’ root taxon 
for Cebidae. Specific name ‘transit’ (Latin, crossing) refers to its implied 
early Miocene dispersal between South and North America. 
Holotype. UF 280128, left upper first molar (MI); Fig. 2a, b). 
Referred material. Left upper second molar (M’; UF 281001; Fig. 2a, b), 
partial left lower first incisor (Ij, UF 280130), right lower second 
incisor (1); UF 267048), right lower canine (C,; UF 280131), possibly 
associated left lower second (P2; UF 280127) and fourth (P4, UF 
280129) premolars (Fig. 2e-g). 

Locality. Lirio Norte (site key YPA-024 in the Florida Museum of 
Natural History Vertebrate Paleontology Collection), Panama Canal 
area, Panama, Central America (Extended Data Fig. 1). 

Age and horizon. Primate fossils were collected from a single horizon 
of sub-aerially exposed ash fall deposits in the upper part of the Las 
Cascadas Formation (Fig. 1a). Magmatic zircons from a non-subaerially 
exposed andesitic tuff, ~0.25m below the primate-bearing horizon 
(Extended Data Fig. 2), yield an age of 20.93 + 0.17 Ma (20) (Fig. 1b and 
Supplementary Table 1), interpreted to be the eruption age of the tuffand 
a close approximation of the absolute age of the primate-bearing horizon. 
The mammalian assemblage (Supplementary Table 6) recovered from 
this horizon is consistent with what is known from the late Arikareean 
Ar4 faunal zone (22.8-19.05 Ma; refs 10, 13) North American Land 
Mammal Age (NALMA) at higher latitudes!*!4 (Extended Data Fig. 3). 
See also Supplementary Information (results section). 

Diagnosis. Medium-sized cebid platyrrhine (~2.7kg) that differs 
from all other cebines in having lower incisors (Ij, I) that are higher 
crowned with an incomplete lingual cingulum, and a first upper 
molar (M') that is considerably larger than the second upper molar 
(M?). Further differs from most cebines: (except Aotus) in having 
an I, cross-sectional area only somewhat smaller than that of 1; 
(except Cebus) in having somewhat inflated lower premolars (P2, P4), 
and a P, talonid and trigonid of similar length; and (except Saimiri 
and Neosaimiri) in having M'? with a short and strong anterior 
cingulum. Further differs: from Cebus and Acrecebus in having M!” 
with a mesiobuccally oriented prehypocrista, lacking a metaconule, a 
postprotocrista that is continuous with a hypometacrista that reaches 
to the base of the metacone, a sharp and distinct hypometacrista, 
and lack of lingual inflation of the protocone; from Cebus in having 
a Py with a smaller hypoconid; and from Saimiri in lacking a 
pericone on M!”. Differs from all callitrichines in having a discrete 


1Florida Museum of Natural History, University of Florida, Gainesville, Florida 32611-7800, USA. Department of Biology, University of Florida, Gainesville, Florida 32611-7800, USA. Department 
of Geological and Atmospheric Sciences, lowa State University, Ames, lowa 50011-1027, USA. “Department of Geological Sciences, University of Florida, Gainesville, Florida 32611-7800, USA. 
5Department of Evolutionary Anthropology, Duke University, Durham, North Carolina 27708-9976, USA. "New Mexico Museum of Natural History and Science, Albuquerque, New Mexico 87104, 
USA. ’Geociencias, Universidad de los Andes, Calle 1A # 18A-10, Edificio IP, Bogota DC 111711, Colombia. 8Smithsonian Tropical Research Institute, Box 0843-03092, Balboa, Ancon, Republic of 
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Figure 1 | Stratigraphy of the primate-bearing locality (YPA-024) in 
central Panama. a, Measured stratigraphic section (in metres) in the 

Las Cascadas Formation showing the positions of the dated rock sample 
(asterisk) and of the Panamacebus fossils as a silhouette of a monkey (right), 
correlated to a schematic stratigraphic position of the Lirio Norte Local 
Fauna and the Centenario Fauna with previously published radiometric 
dates (a, ref. 3; b, ref. 28) indicated (centre), and North American land 
mammal faunal zonation (left). Biochronol., Biochronology; Fm., 
formation; Hemingf., Hemingfordian North American Land Mammal Age; 
MSWD, mean square of weighted deviates. b, Results from U-Pb isotopic 
analyses plotted as a weighted mean of the 7°°Pb/”*8U ages from 37 zircons 
(see Methods and Supplementary Information). 


entoconid on Py, and M!” with a large hypocone and lacking a 
buccal cingulum; from callitrichines (but also Aotus) in having an 
M? with a buccally expanded paracone; and from callitrichines 
(but also Saimiri and Neosaimiri) in having M'” with a well-developed 
prehypocrista that extends to the postprotocrista and encloses the talon 
lingually. For additional description and metrics see Extended Data 
Figs 4-7 and Supplementary Information (results section). 

Panamacebus is similar to other platyrrhines in having a single-rooted 
P, rather than double-rooted as in catarrhines. Among cebids, the 
dental morphology is most similar to that of fossil cebines, including 
middle Miocene Neosaimiri from Colombia (Fig. 2 and Extended Data 
Figs 4-7). For additional comparisons see Supplementary Information 
(results section) and Supplementary Figs 2 and 3. 

Until recently, the oldest evidence for the arrival of primates in 
South America from the Old World was from the late Oligocene 
epoch (26 Ma) of Bolivia!®. The recent discovery of anthropoids from 
eastern Peru’® suggests an earlier arrival into tropical South America, 
perhaps during the late Eocene epoch (~37-34 Ma), although the date 
is uncertain'”. Dispersals from Africa or Asia through Antarctica or 
North America have been suggested", but ‘rafting’ on floating islands 
from Africa across the Atlantic is considered the most likely mechanism 
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Figure 2 | Comparison of Panamacebus with middle Miocene cebid 
Neosaimiri fieldsi from La Venta, Colombia. a, b, Occlusal (a) and 
lingual (b) views of P. transitus left M' (UF 280128: http://dx.doi. 
org/10.17602/M2/M8531) and M? (UE 281001: http://dx.doi.org/10.17602/ 
M2/M8550). c, d, Occlusal (c) and lingual (d) views of N. fieldsi right M! 
(IGM-KU 89008, mirrored: http://dx.doi.org/10.17602/M2/M8541) and 
M? (IGM-KU 89018, mirrored: http://dx.doi.org/10.17602/M2/M8543). 
e-g, Occlusal (e), lingual (f) and buccal (g) views of P. transitus partial 
left 1, (UF 280130: http://dx.doi.org/10.17602/M2/M8400), right I, (UF 
267048, mirrored: http://dx.doi.org/10.17602/M2/M8395), right C; (UF 
280131, mirrored: http://dx.doi.org/10.17602/M2/M8401), left P2 (UF 
280127: http://dx.doi.org/10.17602/M2/M8397) and left Py (UF 280129: 
http://dx.doi.org/10.17602/M2/M8399). h-j, Occlusal (h), lingual (i) and 
buccal (j) views of N. fieldsi left dentary with -M) (UCMP 39205: http:// 
dx.doi.org/10.17602/M2/M1879, http://dx.doi.org/10.17602/M2/M1880). 
Images generated from microCT scan data (see Methods and links 
associated with specimen numbers). Scale bars: 1 mm (a-d); 5mm (e-j). 


for their arrival into South America’. Results from phylogenetic 
analyses of new morphological data coded into a previously published 
matrix! vary with the use of different molecular constraints!°, but 
both support Panamacebus within crown Platyrrhini, specifically 
within Cebidae (Fig. 3; Supplementary Information (results 
section) and Supplementary Figs 4-10), suggesting a northward 
early Miocene dispersal across the Central American Seaway (CAS) 
from South to North America, perhaps associated with the onset of 
extensive development of terrestrial landscapes in Central America 
as a consequence of the initial collision with South America”? (Fig. 4 
and Extended Data Fig. 8). 

Using our most resolved tree (Fig. 3a), the minimum age for a split 
between Callitrichinae and Cebinae can be calibrated with the radio- 
isotopic date (20.77-21.90 Ma, posterior probability (PP): 1.0). The 
results of our divergence dating analysis (Extended Data Figs 9, 10 and 
Supplementary Information (results section)) using this date as a prior 
(Supplementary Table 2), considerably narrow previously reported 
range estimates (16.07-23.5 Ma; ref. 20; and 15.66-24.03 Ma; ref. 21). 
Our inferred divergence of Cebidae from Atelidae (21.84-24.93 Ma, 
PP: 0.99) also greatly reduces the interval estimated in a previous study 
(18.14-26.11 Ma; ref. 20). Our divergence estimates for other primate 
clades are generally congruent with previous studies (Supplementary 
Information (results section)). Panamacebus does not seem to share a 
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a Cebuella b Cebuella 
Callithrix Callithrix 
Callimico Callimico 
poe a Callitrichinae _> anneal 
Lagonimicot Lagonimicot 
Aotus dindensist Aotus dindensist | Cebinae 
Mohanamicot Mohanamicot 
Cebus Cebus 
Acrecebust 5 Acrecebust 
Pananacebust” | Cebinas os 
Neosaimirit Panamacebust* 
Aotus Aotus 
Brachyteles Brachyteles 
Lagothrix Lagothrix 
eee Atslidae pee a 
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Cacajao Cacajao 
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Pithecia Pithecia 
Nuciuptort | Pitheciidae oa 
Proteropitheciat Proteropitheciat 
Callicebus Callicebus 
Xenothrixt Xenothrixt 
Antillothrixt Antillothrixt 
Paralouattat Paralouattat 
Dolichocebust Dolichocebust 
Chilecebust Chilecebust 
Tremacebust Tremacebust 
Carlocebust Carlocebust 
Homunculust Homunculust 
Mazzonicebust Mazzonicebust 
Soriacebust Soriacebust 
Branisellat Branisellat 


Figure 3 | Results from phylogenetic analyses showing Panamacebus within crown Platyrrhini. a, Strict consensus of two most parsimonious trees 
analysed using molecular constraint 1 (ref. 1). b, Strict consensus of four most parsimonious trees analysed using molecular constraint 2 (ref. 20). Circles 
at nodes correspond to bootstrap support values: orange, greater than 85; blue, 50-85; black, less than 50. Daggers indicate fossil (extinct) taxa. Asterisk 


indicates P. transitus. 


close relationship with fossil primates from the Quaternary period of 
Cuba, Jamaica and Hispaniola (Fig. 3a, b), although they probably also 
dispersed from tropical South America into the Greater Antilles by the 
early Miocene’. This result suggests two unrelated migrations from 
South America to both tropical North America (a crown platyrrhine) 
and the Caribbean islands (probable stem platyrrhines) during the 
Oligocene/early Miocene epochs. 

While attribution of fossils to crown versus stem platyrrhines has 
been the subject of substantial ongoing debate”, our results support 
an exclusively tropical crown platyrrhine radiation with only the 
more distantly related (stem) platyrrhines distributed into the higher 
latitudes of South America’. However, if the alternative view proves 
to be correct, that primates of similar age in Patagonia are also crown 
platyrrhines (the long-lineage hypothesis)**, Panamacebus would 
still be at least equal in age to the oldest disputed crown platyrrhine. 
Coupled with fossil evidence for primates in the lower Miocene of 
the Greater Antilles”*, this new record in Central America shows that 
primates were dispersing northward out of South America with a previ- 
ously unrecognized early Miocene circum-Caribbean distribution. The 
primate record is consistent with new tectonic and palaeogeographic 
reconstructions”® of a relatively narrow CAS in the early Miocene 
and corresponding dispersals inferred on the basis of molecular and 
fossil data for many terrestrial organisms, including amphibians, 
reptiles, freshwater fishes, insects and plants*. In this context, it is 
perhaps surprising to not also find fossil evidence of caviomorph 
rodents and sloths in the early Miocene of Panama since they are found 
ina slightly younger locality that includes evidence of a primate in the 
Greater Antilles”*. Absence of primates from the overlying Centenario 
Fauna", which is ~2 million years (Myr) younger than the Lirio Norte 
Local Fauna, might suggest a single waif dispersal event of a short-lived 
population. Alternatively, with increased collecting efforts the fossil 
record may yet provide evidence for other earlier mammal dispersals 
out of South America, and/or primates having a longer history in trop- 
ical North America than is currently known. 

Prior to this discovery, the oldest fossil evidence for mammalian 
dispersal from South to North America is 8.5-9 Ma mylodontid and 


megalonychid sloths that, along with later immigrants associated with 
different pulses of the GABI, would have necessarily had to move 
through the tropics, but also quickly dispersed to higher latitudes. By 
contrast, New World monkeys clearly crossed into the tropical lowlands 
of Central America at least once by 21 Ma, but there is no record of 
them in localities of similar age at higher northern latitudes. This is 
especially notable in the Gulf Coastal Plain, where most other mammals 


Farallon Plate 


_ Reconstruction 
blocks 


Figure 4 | Palaeogeographic reconstruction showing hypothetical 
dispersal route of Panamacebus across the CAS in the early Miocene. 
Yellow and ochre colours indicate subaerial environments, blue colours 
indicate marine environments (dark, coastal and platform; light, abyssal). 
Criteria used to arrive at this reconstruction include regional tectonic 
reconstructions, local and regional palaeomagnetic data, and regional 
strain markers and piercing points (see Extended Data Fig. 8, Methods, 
and Supplementary Methods). Fm., formation; Fms, formations. 
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share close affinities with those found in Panama®”. Absence of early 
Miocene New World monkeys at higher northern latitudes could be 
explained by the limited extent of suitable tropical forest habitats, much 
like today. However, their distribution in South America, including very 
high latitudes of Patagonia, at localities of very similar age”, introduces 
a potential paradox in which primates would be limited to tropical 
forests in North but not South America”®. A possible resolution is 
found in the taxonomic composition and historical biogeography of 
the forests themselves. Early Miocene forests of tropical South America 
have a shared Gondwanan history with those at higher southern 
latitudes, including Patagonia, and southern Central America (Costa 
Rica and Panama), which are dominated by South American-derived 
tropical rainforest taxa”. Northern tropical Central American forests, 
however, have predominantly Laurasian affinities in the early Miocene 
(Supplementary Tables 3 and 7 and Supplementary Information (results 
section)). Thus, dispersal of New World monkeys further northward in 
the early Miocene was probably limited more by their niche conserva- 
tism and a boundary between forests with very different evolutionary 
histories than by differences in climate or the existence of major geo- 
graphical barriers. The New World tropics may have acted as a holding 
pen for some tropical mammals for at least 12 Myr before xenarthrans 
first appeared at higher latitudes of North America’’, suggesting that 
the distribution of South American-derived tropical forests played an 
important part in the early dynamics of the GABI. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 


Geochronology. Sample MD 11 was collected from an ash layer that forms the 
upper-most part of a several-metre thick, welded andesitic tuff. The tuff is exposed 
within the conformable section of the Las Cascadas Formation approximately 
0.25 m below the mammal fossil-bearing unit (Fig. 1). Zircons were separated 
from two 10-kg sample aggregates using conventional magnetic and density con- 
centration methods. Clear to light pink, euhedral zircons about 100 1m in length 
were mounted in epoxy and ground to reveal internal surfaces. 

U-Pb isotopic analyses were conducted on 20-|1m spots on zircon grains 
using a Nu-Plasma multicollector plasma source mass spectrometer coupled to 
a New Wave 213-nm laser (LA-MC-ICP-MS). The data were acquired using the 
Nu-Instruments Time Resolved Analysis software. Data calibration and drift 
were based on multiple analyses of the reference zircon FC-1 (ref. 29) repeated 
between every 10 ablations of the unknowns. Concordant U-Pb data were yielded 
by 37 zircons (Supplementary Table 1). A weighted mean of the 7°°Pb/?**U ages 
gives 20.93 + 0.17 Ma (2c) (Fig. 1b), which we interpret to be the eruption age 
of the andesitic tuff and the absolute age of the Las Cascadas mammal fossil 
horizon (Supplementary Information (results section)). These data are consistent 
with a ?SPb/?8U age of 19.3 + 0.4 Ma for a welded tuff unit within the younger 
Culebra Formation®”. 

Three-dimensional data acquisition. Fossil platyrrhine specimens (UF 267048, 
28001, 280127-280131; UCMP 38762, 38989, 39205) and casts (IGM-KU 89008, 
89011, 89018, 89019, 89021, 89086, 89092, 89104, 90016) were scanned at the 
Duke University Shared Materials Instrumentation facility in Durham, North 
Carolina, using a Nikon XTH 225 ST MicroCT scanner (Supplementary Table 4). 
The resulting scans were reconstructed into tiff stacks and were imported into 
Avizo 7 (Visualization Sciences Group) for surface visualization (Fig. 2, Extended 
Data Figs 4-7 and Supplementary Figs 2, 3) and manipulation. Data were distrib- 
uted and shared using Duke University’s three-dimensional data archive (http:// 
www.morphosource.org). DOI links to three-dimensional data are provided in 
the figure captions. 

Comparative morphology. Comparisons were done within the morphological 
framework outlined and discussed previously’, with reference to the literature 
for fossil taxa as discussed in Supplementary Methods (see also table 1 in ref. 1 
and references therein). Additional comparisons were made to specimens and 
casts of fossil and extant platyrrhines. Three-dimensional images of relevant fossil 
platyrrhines were reconstructed from microCT scans (see earlier). 
Phylogenetic analysis of morphological data. We conducted three phyloge- 
netic analyses using maximum parsimony to examine the phylogenetic position 
of Panamacebus. The first analysis was conducted using the same parameters 
as the original analysis! with 177 ordered characters and all characters given a 
weight other than one (Supplementary Data 1). In the second analysis we used 
a different constraint tree derived from a recent molecular supermatrix?*”! 
(re-analyses of these data used for the constraint tree are described in 
Supplementary Methods). The third analysis was performed without enforcing 
a constraint tree. Support for the topology of the resulting trees was determined 
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by bootstrapping (see Supplementary Methods for detailed descriptions of 
phylogenetic analyses). 

Divergence dating recalibration analyses. Divergence dating analyses were 
conducted to determine the effect of a new minimum age for the divergence of 
Callitrichinae and Cebinae based on the radioisotopic date obtained for the horizon 
where the Panamacebus fossils were discovered. We took a conservative approach 
and calibrated the divergence of Cebinae and Callitrichinae as opposed to cal- 
ibrating a node within Cebinae as suggested by the placement of Panamacebus 
in the phylogeny using the first molecular constraint (Fig. 3a). Using a reduced 
molecular data set derived from ref. 20, we conducted two separate Bayesian 
Markov chain Monte Carlo (MCMC) analyses (see Supplementary Methods and 
Supplementary Data 3). Our analyses employed a lognormal relaxed molecular 
clock and a general time reversible (GTR) model with a gamma distribution, using 
four rate categories and estimated base frequencies. We set 15 calibration points 
(Supplementary Table 2 and Supplementary Data 4); 14 nodes were calibrated as 
described previously” and a 15th node was calibrated to the date corresponding 
to the age of Panamacebus (20.93 + 0.17 Ma). A random starting tree was used 
and analyses were conducted with an MCMC chain length of 200,000,000 states 
sampled every 10,000 states (for detailed description of divergence dating analyses 
see Supplementary Methods). 

Palaeogeographic reconstruction. Figure 4 and Extended Data Fig. 8 show a 
reconstruction built using fault-bound tectonic blocks restored to a late Oligocene- 
early Miocene palinspastic palaeogeographic position. Criteria used to arrive at 
this reconstruction include: (1) regional tectonic reconstructions; (2) local and 
regional palaeomagnetic data; and (3) regional strain markers and piercing points. 
Published stratigraphic columns with age constraints spanning the late Oligocene- 
early Miocene were placed in their corresponding palaeogeographic position. See 
Supplementary Methods for detailed methods and references. 

Palaeobotanical analysis of Miocene forests. We compiled pollen and spore 
occurrence data from nine different formations of Oligocene to middle Miocene age 
(Supplementary Table 3). The samples come from Florida, Puerto Rico, southern 
Mexico, Costa Rica and Panama; the number of samples per formation ranges from 
2 to 52 (Supplementary Table 7). We assigned the plant families and genera to 
coarse biogeographic categories on the basis of modern species distributions and 
fossil evidence. The categories are Gondwana-~Amazonian, Gondwana-northern 
Andean, Gondwana-southern Andean, Laurasian, or unassigned. Next, we 
calculated the percentage contribution of the biogeographic regions to the species 
(morphotype) richness in each fossil flora. Taxa that were not assigned to a 
biogeographic region were excluded from calculations of biogeographic affinity 
as described previously”. See Supplementary Methods for detailed references and 
discussion. 


29. Black, L. P. et al. The application of SHRIMP to Phanerozoic geochronology; a 
critical appraisal of four zircon standards. Chem. Geol. 200, 171-188 (2003). 

30. Montes, C. et al. Arc-continent collision and orocline formation: closing of the 
Central American seaway. J. Geophys. Res. 117, B4 (2012). 
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Extended Data Figure 1 | Map of the southern part of Panama Canal (Gaillard Cut). Black circles mark the position of the Lirio Norte Local Fauna 
locality (YPA024) and other specific terrestrial vertebrate collecting sites (Centenario Fauna) in the area (modified from ref. 10). 
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Extended Data Figure 2 | Photograph of the northern wall (south-facing) in the Lirio Norte Local Fauna locality (YPA024). Dated rock sample 
MD11 was collected from an ash layer that forms the upper-most part of a several-metre thick, welded andesitic tuff. The tuff is exposed within the 
conformable section of the Las Cascadas Formation approximately 0.25 m below, and in close proximity to, the mammal fossil-bearing unit that includes 
the Panamacebus fossils. 
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Extended Data Figure 3 | Generic-level biostratigraphy of selected taxa from the early Miocene (~21 Ma) Las Cascadas Formation, Panama. 
Individual biochronologies were interpreted from temporal ranges known from higher-latitudes. The calibration of the NALMA boundaries is as 
described previously'?". 
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Extended Data Figure 4 | Detailed upper molar comparisons 

of Panamacebus to Neosaimiri. a—d, Left to right, occlusal, 

distal, lingual, mesial and buccal views of the left M! (UF 280128: 
http://dx.doi.org/10.17602/M2/M8531) (a) and the left M? (UF 281001: 
http://dx.doi.org/10.17602/M2/M8550) (d) of P. transitus compared with 
M! and M? of N. fieldsi (b, c, e-g): right M! (IGM-KU 89008: 
http://dx.doi.org/10.17602/M2/M8541) (b), right M' (IGM 89019: 
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http://dx.doi.org/10.17602/M2/M8544) (c), right M? 

(IGM-KU 89018: http://dx.doi.org/10.17602/M2/M8543) (e), 

right M? (IGM-KU 89104: http://dx.doi.org/10.17602/M2/M8548) (f), 
and right M* (IGM-KU 89011: http://dx.doi.org/10.17602/M2/M8542) 
(g). Images were generated from microCT scan data (see Methods and 
links associated with specimen numbers). Right-sided teeth were flipped 
to facilitate comparison with left-sided teeth. Scale bar, 1 mm. 


s Limited. All ri 


Extended Data Figure 5 | Detailed upper molar comparisons of 
Panamacebus to Neosaimiri and Cebus. a-g, Left to right, occlusal, distal, 
lingual, mesial and buccal views of the left M' (UF 280128: http://dx.doi. 
org/10.17602/M2/M8531) (a) and the left M? (UF 281001: http://dx.doi. 
org/10.17602/M2/M8550) (c) of PB. transitus compared with M! and M? 

of N. fieldsi (b, d): right M! (IGM 89019: http://dx.doi.org/10.17602/M2/ 
M8544) (b), and right M? (IGM-KU 89104: http://dx.doi.org/10.17602/ 


M2/M8548) (d); and the left M! and M? of Cebus capucinus (USNM 
291128; http://dx.doi.org/10.17602/M2/M8627) in occlusal (e), lingual 
(f) and buccal views (g). Images were generated from microCT scan data 
(see Methods and links associated with specimen numbers). Right-sided 
teeth were flipped to facilitate comparison with left-sided teeth. Scale 
bar, 1mm. 
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Extended Data Figure 6 | Detailed comparisons of Panamacebus 
lower teeth to those of Neosaimiri and Stirtonia. a-k, Left to right, 
occlusal, distal, lingual, mesial and buccal views of the partial left I; 
(UF 280130: http://dx.doi.org/10.17602/M2/M8400) (a), right I, (UF 
267048, mirrored: http://dx.doi.org/10.17602/M2/M8395) (d), left P2 
(UF 280127: http://dx.doi.org/10.17602/M2/M8397) (g), left P4 

(UF 280129: http://dx.doi.org/10.17602/M2/M8399) (i) and right 

C, (UF 280131, mirrored: http://dx.doi.org/10.17602/M2/M8401) 

(k) of P. transitus compared with the right I, (IGM-KU 89086: 


http://dx.doi.org/10.17602/M2/M8546) (b), right I, IGM-KU 89092: 


http://dx.doi.org/10.17602/M2/M8547) (e), left I, (UCMP 39205: http:// 
dx.doi.org/10.17602/M2/M1880) (f), right P2 IGM-KU 90016: 
http://dx.doi.org/10.17602/M2/M8549) (h) and right C; (IGM-KU 89021: 
http://dx.doi.org/10.17602/M2/M8432) (j) of N. fieldsi and the right 

I, (UCMP 38989: http://dx.doi.org/10.17602/M2/M1799) of Stirtonia 
tatacoensis (c). Images were generated from microCT scan data 

(see Methods and links associated with specimen numbers). Right-sided 
teeth were flipped to facilitate comparison with left-sided teeth. Scale 

bar, 5mm. 
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Extended Data Figure 7 | Detailed comparisons of Panamacebus lower 
teeth to those of Cebus. a-i, Occlusal (a), lingual (c) and buccal (e) views 
of P. transitus partial left 1, (UF 280130: http://dx.doi.org/10.17602/M2/ 
M8400), right I, (UF 267048, mirrored: http://dx.doi.org/10.17602/M2/ 
M8395), right C, (UF 280131, mirrored: http://dx.doi.org/10.17602/M2/ 
M8401), left P, (UF 280127: http://dx.doi.org/10.17602/M2/M8397) and 
left P4 (UF 280129: http://dx.doi.org/10.17602/M2/M8399); occlusal (b), 


lingual (d) and buccal (f) views of C. capucinus right P,-P4 (USNM 
291128: http://dx.doi.org/10.17602/M2/M8629); and occlusal (g), lingual 
(h) and buccal (i) views of C. capucinus left dentary with I1-M3 (USNM 
291236: http://dx.doi.org/10.17602/M2/M8622). Right-sided teeth 

were mirrored to facilitate comparison with left-sided teeth. Images 

were generated from microCT scan data (see Methods). Scale bars, 

5 mm. 
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Extended Data Figure 8 | Detailed palaeogeographic reconstruction 
of the Panama Canal Basin region during the late Oligocene-early 
Miocene, showing the location of key geological formations, faults 
and tectonic blocks. Sedimentary environments were extrapolated from 
published stratigraphic sections that were placed over the palinspastic 
reconstruction in the following locations: (1) Pacific sections that 
include conglomerate and sandy strata; (2) upper Magdalena Basin; 

(3) Amaga Formation coal-bearing and sandy-conglomeratic; (4) Choco; 
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(5) easternmost Panama; 96) western Panama; (7) Panama; (8) Canal basin; 
(9) northwestern Colombia; (10) Sierra Nevada Santa Marta, unnamed 
sandy and conglomeratic strata; (11) Guajira Peninsula; (12) Falcon; 

(13) Falcon/Lara; (14) middle Magdalena Basin; (15) Floresta Massif; 

(16) axial Cordillera Oriental; (17) foothills; (18) southern middle 
Magdalena Basin. See Supplementary Methods for references and detailed 
discussion. 


© 2016 Macmillan Publishers Limited. All rights reserved 


LETTER 


a8 17.08, 32.56 


0.99 9.49, 16.73 


Pithecia pithecia 
Pithecia irrorata 
Cacajao melanocephalus 
Cacajao calvus 
Chiropotes israelita 
Chiropotes chiropotes 


Callicebus 
Callicebus 
Callicebus 
Callicebus 
Callicebus 
Callicebus 


nigrifrons 
coimbrai 
moloch 
cupreus 
caligatus 
donacophilus 


Alouatta palliata 
Alouatta belzebul 
Alouatta sara 
Alouatta caraya 
Brachyteles arachnoides 
Brachyteles hypoxanthus 
Lagothrix cana 
Lagothrix lagotricha 
Ateles chamek 
Ateles paniscus 
Ateles fusciceps 
Ateles hybridus 
Ateles geoffroyi 
Saimiri boliviensis 
Saimiri ustus 
Saimiri oerstedii 
Saimiri sciureus 
Cebus capucinus 
Cebus albifrons 
Cebus olivaceus 
Cebus apella 

Cebus robustus 
Cebus xanthosternos 
Aotus griseimembra 
Aotus trivirgatus 
Aotus nancymaae 
Aotus azarae 

Aotus infulatus 
Saguinus fuscicollis 
Saguinus imperator 


Saguinus 
Saguinus 
Saguinus 
Saguinus 


mystax 
labiatus 
geoffroyi 
Oedipus 


ut 22.93, 27.14 0.99 
: 11.99, 19.47 
= 7.66, 14.12 
0.99 | 21.84, 24.93 
18 13.84, 19.78 
1.0 
20.77, 21.90 
1.0 L_} 49.33, 21.55 
ba 12.45, 17.44 
0.99] 11.39, 
16.46 
19 8.60, 13.96 


Saguinus bicolor 
Saguinus midas 
Leontopithecus chrysomelas 
Leontopithecus rosalia 
Callimico goeldii 
Callithrix pygmaea 
Callithrix argentata 
Callithrix humeralifera 
Callithrix geoffroyi 
Callithrix kuhlii 
Callithrix penicillata 
Callithrix jacchus 


Extended Data Figure 9 | Maximum clade credibility tree summarizing the concatenated trees from the divergence dating analysis using the birth- 
death model showing the Platyrrhini. Ninety-five per cent highest posterior density intervals are shown on key nodes and the corresponding branches 
are labelled with posterior probability values. For clarity, only the Platyrrhini are shown here. 
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Extended Data Figure 10 | Maximum clade credibility tree summarizing the concatenated trees from divergence dating the analysis using the Yule 
model showing the Platyrrhini. Ninety-five per cent highest posterior density intervals are shown on key nodes and the corresponding branches are 
labelled with posterior probability values. For clarity, only the Platyrrhini are shown here. 
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Restoring cortical control of functional movement 
in a human with quadriplegia 


Chad E. Bouton!+, Ammar Shaikhouni**, Nicholas V. Annetta!, Marcia A. Bockbrader**, David A. Friedenberg®, 
Dylan M. Nielson*?, Gaurav Sharmal, Per B. Sederberg*®, Bradley C. Glenn’, W. Jerry Mysiw**, Austin G. Morgan!, 


Milind Deogaonkar?* & Ali R. Rezai?* 


Millions of people worldwide suffer from diseases that lead to 
paralysis through disruption of signal pathways between the brain 
and the muscles. Neuroprosthetic devices are designed to restore 
lost function and could be used to form an electronic ‘neural bypass’ 
to circumvent disconnected pathways in the nervous system. It has 
previously been shown that intracortically recorded signals can be 
decoded to extract information related to motion, allowing non- 
human primates and paralysed humans to control computers and 
robotic arms through imagined movements’"!. In non-human 
primates, these types of signal have also been used to drive activation 
of chemically paralysed arm muscles!”*!3. Here we show that 
intracortically recorded signals can be linked in real-time to muscle 
activation to restore movement in a paralysed human. We used a 
chronically implanted intracortical microelectrode array to record 
multiunit activity from the motor cortex in a study participant with 
quadriplegia from cervical spinal cord injury. We applied machine- 
learning algorithms to decode the neuronal activity and control 
activation of the participant’s forearm muscles through a custom- 
built high-resolution neuromuscular electrical stimulation system. 
The system provided isolated finger movements and the participant 
achieved continuous cortical control of six different wrist and hand 
motions. Furthermore, he was able to use the system to complete 
functional tasks relevant to daily living. Clinical assessment showed 
that, when using the system, his motor impairment improved from 
the fifth to the sixth cervical (C5-C6) to the seventh cervical to first 
thoracic (C7-T1) level unilaterally, conferring on him the critical 
abilities to grasp, manipulate, and release objects. This is the first 
demonstration to our knowledge of successful control of muscle 
activation using intracortically recorded signals in a paralysed 
human. These results have significant implications in advancing 
neuroprosthetic technology for people worldwide living with the 
effects of paralysis. 

The study participant was a 24-year-old male with stable, 
non-spastic C5/C6 quadriplegia from cervical spinal cord injury 
(SCI) sustained in a diving accident 4 years previously. He under- 
went implantation of a Utah microelectrode array (Blackrock 
Microsystems) in his left primary motor cortex. As shown in Fig. 1a, 
the hand area of the primary motor cortex was identified preopera- 
tively by performing functional magnetic resonance imaging (fMRI) 
while the participant attempted to mirror videos of hand movements. 
The final array implantation location was chosen during surgery, 
targeting the hand area while avoiding sulci and injury to large cor- 
tical vessels. The implant location was confirmed by co-registration 
of postoperative computed tomography imaging with preoperative 
fMRI (Fig. 1a) and is consistent with the ‘knob’ region of the primary 
motor cortex*4, 


The participant attended up to three sessions weekly for 15 months 
after implantation to use the neural bypass system (NBS). In each 
session, he was trained to utilize his motor cortical neuronal activity to 
control a custom-built high-resolution neuromuscular electrical stim- 
ulator (NMES). The NMES delivered electrical stimulation to his para- 
lysed right forearm muscles using an array of 130 electrodes embedded 
in a custom-made flexible sleeve wrapped around the arm (Fig. 1b). 
The participant was positioned in front of a computer monitor, and a 
stereo camera was placed overhead to track and record hand move- 
ments (Fig. 1c). During the study, up to 50 single units could be isolated 
in a given session. Near the end of the study, 33 units could be isolated 
with a mean signal-to-noise ratio of 3.05 + 0.81 (mean +s.d.) including 
units that responded to imagined or performed wrist movements 
(Fig. 1d). (See Extended Data Fig. 1 for additional unit activity.) Wavelet 
decomposition of the multiunit activity recorded from 96 microelec- 
trodes was used to produce mean wavelet power (MWP) features for 
decoding (Fig. le) (see Methods). 

To assess the ability of the NBS to restore individual movements, we 
focused on six wrist and hand movements that were all impaired by the 
participant’s injury and reactivated by stimulation of forearm muscles 
(see Supplementary Video 1 showing the participant attempting the 
six movements without the use of the NBS). Each session began with 
recalibration of the NMES to map electrode stimulation patterns to 
evoked movements (see Methods). Cortical activity was continuously 
decoded as the participant attempted the six selected movements inter- 
leaved with rest periods, as cued by an animated virtual hand on the 
computer monitor. Changes in the MWP patterns for each movement 
were captured during the test. These patterns were then processed by 
multiple simultaneous neural decoders, one for each trained move- 
ment, using a nonlinear kernel method with a non-smooth support 
vector machine!>. The decoders were trained in successive blocks and, 
once trained, their outputs were continuously compared using the 
highest decoder output to control the corresponding NMES movement 
stimulation pattern (see Methods). During movement, a large portion 
of the stimulation artefact that occurred during a stimulation pulse was 
removed, but stimulation effects still remained (see Methods). 

To test the system's performance, test blocks were performed con- 
sisting of five trials of each of the six trained movements presented 
in random order. At the beginning of each trial, the participant was 
visually cued by the virtual hand demonstrating a target movement. 
Representative data, including modulation of MWP (before and after 
stimulation begins), decoder outputs, and corresponding movement 
state are shown in Fig. 2. MWP increases by a factor of 2-8 after stim- 
ulation begins because of residual stimulation artefact (see Methods 
and Extended Data Fig. 2). However, since the neural decoders were 
trained with MWP from before and during stimulation, they were able 
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Figure 1 | Experimental setup and neural modulation. a, Red regions 
are brain areas active during attempts to mimic hand movements, where 
the t-values for the move-rest T1-weighted fMRI contrast are greater 
than 7; The implanted microelectrode array location from post-operation 
computed tomography is shown in green; The overlap of the red and green 
regions is shown in yellow. b, Neuromuscular electrical stimulation sleeve. 
c, NBS in use with the participant in front a computer monitor. 
d, Examples of rasters and peristimulus histograms from one discriminated 
unit (channel 37 unit 1) with response to attempted wrist movements. The 
participant was presented with cues to attempt wrist flexion (WE), wrist 
extension (WE), wrist radial deviation (WR), and wrist ulnar deviation 
(WU). Each cue was presented for a random duration of 3-4s followed 
by a random 3-4s rest period. We presented 20 trials of each in random 
order. The top part of each subpanel is a raster, each dot represents a 
spike, and each row of spikes represents data from one trial. All trials were 
aligned on cue presentation (time zero, red dashed line). At the right of 
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the panel is an example of 500 randomly selected waveforms from the 
discriminated unit (blue) from channel 37 that responded to wrist, elbow, 
and shoulder movements. The thick yellow line represents the average 
waveform for this unit which showed increased firing rate for imagined 
wrist flexion and wrist ulnar deviation (as well as elbow and shoulder 
movements, see Extended Data Fig. 1). Because the participant was 

asked to imagine and hold the movement throughout the cue period, the 
increase of activity during the rest cue is probably due to the participant 
imagining the antagonist movement to return to a neutral position. 

e, Wavelet-processed neural data from channel 37 corresponding to the 
same movements in d. The bottom part of each subpanel shows the mean 
wavelet coefficients for each movement and each trial over time. Each row 
represent data from a trial. The top part shows the MWP averaged over 
the trials in dark red with a confidence interval of 1s.d. around the mean 
shown in light pink. The trials were centred on cue presentation (black 
dashed line). (Photographs by A. Morgan and T. R. Massey.) 
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Figure 2 | MWP and system performance for individual hand 
movements. a-f, Heat maps of MWP representing processed signals from 
the microelectrode array, neural decoder output scores (dashed line), 

and physical hand movements (solid line) as detected by a computer-based 
video classification algorithm for each movement type: thumb extension 
(a), wrist flexion (b), wrist extension (c), middle flexion (d), thumb flexion 
(e), and hand open (f). Of the six simultaneously running decoders, the 
output score from the one with the highest amplitude greater than zero 
was used to turn on/off the stimulation. The vertical white line marks 
when the decoder crosses zero and the stimulation is turned on. MWP 
increases (by a factor of 2-8) during stimulation because of residual 
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stimulation artefact (see Methods). The MWP plot is split into two areas 
with two different colour scales to facilitate visualization of MWP before 
and during stimulation. Trial data shown range from 0.5s before the cue 

to 2.5s after the cue ended. A 1s wide boxcar filter is applied to the MWP 
data causing a delay of ~0.5 s. Each subfigure represents one trial out of 
the total five trials for each individual hand movement and the time scale 
on the x axis shows the absolute time when the particular movement 
occurred during the test block. The MWP is in units of standard deviations 
away from a baseline non-movement period. Only decoder outputs greater 
than zero are shown for visual clarity. 
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Figure 3 | GRASSP clinical assessment. GRASSP performance on each 
domain was scored by percentage of possible points earned, and normed to 
benchmarks of the International Standards for Neurological Classification 
of Spinal Cord Injury and the American Spinal Injury Association 
Impairment Scale (greyscale regions in graph). The participant’s baseline 
score (red), obtained 8 months after implantation, falls within the C5-C6 
norms for strength on MMT and quantitative prehension (gross grasping 
ability) and between C7-C8 for qualitative prehension (prehensile skills). 
Using the NBS (green), between 8 and 9 months after implantation, his 
motor function improved in all areas tested, with strength nearly achieving 
C8-T1 level function, qualitative prehension similarly improving to 
between C8-T1 level, and quantitative prehension improving to C6 level. 
Qualitative prehension was limited by lack of stimulation electrodes 

to facilitate thumb opposition. MMT strength was limited by lack of 
stimulation electrodes over elbow extensors and hand intrinsics. 


to recognize the correct imagined movement to initiate stimulation 
and the participant’s desire to sustain and subsequently terminate the 
target movement (see Supplementary Video 2 showing the participant 
attempting the six movements using NBS). Accuracy (percentage of 
video frames where observed movement matches the cue), sensitiv- 
ity (percentage of video frames during cued movement where correct 
movement was observed), and specificity (percentage of video frames 
correctly identified during rest or non-target movement cues) for the 
trained movements were measured by computer-based evaluation of 
video frames of hand movements (see Methods and Extended Data 
Fig. 3a). Using the system, the participant achieved an overall accu- 
racy of 70.4+1.0% (mean +s.d., P< 0.01 by permutation test!®). 
His accuracy for performing individual movements ranged from 


Figure 4 | Grasp-pour-and-stir functional movement task. 

a-f, Sequential snapshots from the functional movement task showing the 
participant opening his hand (a), grasping the glass bottle (b), pouring 

its contents (dice) into a jar (c), grasping a stir stick from another jar (d), 
transferring the stir stick without dropping it (e), and using it to stir the 
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93.140.5% for wrist flexion (P < 0.01) to 97.3 40.3% for thumb 
flexion (P < 0.01). Sensitivity ranged between 32.9 + 3.8% for thumb 
extension and 81.9 + 2.6% for wrist extension, and specificity ranged 
between 94.8 +0.5% for wrist flexion and 99.8 + 1.0% for thumb flex- 
ion (Extended Data Fig. 3b). The participant was able to perform sim- 
ilar basic movement tasks with text or verbal cues as well, suggesting 
the mirror neuron system is not required for these tasks. 

To characterize the recovery of upper limb function when using 
the NBS, a physiatrist performed assessments of upper limb senso- 
rimotor function using the Graded and Redefined Assessment of 
Strength, Sensibility, and Prehension (GRASSP) test!”. The GRASSP 
is a standardized test with excellent inter-rater and test-retest reliability 
developed to assess sensorimotor impairment of patients with cervical 
SCI across the following domains of the Manual Muscle Test (MMT): 
strength, dorsal sensation, ventral sensation, gross grasping ability 
(qualitative prehension), and prehensile skills (quantitative prehen- 
sion). All five GRASSP domains were assessed without use of NBS to 
generate a baseline score. As only motor, and not sensory, function was 
expected to change with use of the NBS, MMT, gross grasping ability, 
and prehensile skills subtests were repeated using the system. The NMES 
was calibrated to map electrode stimulation patterns to desired evoked 
movement patterns using an anatomically guided trial-and-error 
approach and a new decoder was trained for each individual task, rather 
than one for all tasks (see Methods). Figure 3 shows that when the par- 
ticipant used the NBS, his MMT strength improved from C6 to C7—C8 
level, his gross grasping ability improved from C7—C8 to C8-T1 level, 
and his prehensile skills improved from C5 to C6 level. These findings 
quantify the reduction in functional motor impairments possible with 
NBS for patients with cervical SCI. Fatigue was sometimes observed 
after multiple repetitions of sustained (3-4s) stimulation. 

To investigate the effectiveness of the NBS in assisting with daily 
activities relevant to daily life, we asked the participant to perform a 
complex functional movement that he was unable to complete without 
the system. The task required him to grasp a bottle with a cylindrical 
grasp, maintain the grasp while pouring its contents into a jar, open 
his hand to release the bottle, and use a precision pinch to pick up and 
hold a stir stick to stir the jar’s contents (Fig. 4 and Supplementary 
Video 3). During the task, the participant used his residual shoulder 
and elbow movements to guide his hand in space while he performed 
the task; however, this presented a significant challenge in initial test- 
ing. The participant had difficulty in maintaining his grasp of the 
objects, particularly during transfer, and often dropped them. We 
observed differences in the MWP associated with imagined grasping, 


dice in the jar (f). The task required the participant to evoke different 
cortical modulation patterns to control the opening of his hand, perform 
a cylindrical palmar grasp, and achieve a precision pinch grasp, while 
simultaneously moving his arm. (Video by C. Majstorovic.) 
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stimulation-induced grasping, and grasp-and-transfer movements 
(Extended Data Fig. 4). To improve the decoding performance during 
these complex tasks, the MWP data from all three of these periods 
were used to train a nonlinear support vector machine (see Methods). 
Using this approach, the participant was able to successfully complete 
the grasp-pour-and-stir task three out of five times in 10 min with a 
completion time of 42 +10s (mean +s.d.) for successful trials, demon- 
strating that he was able to utilize the NBS for a functional activity that 
he could not perform otherwise (see Supplementary Video 4 showing 
the participant attempting the task without the use of NBS). 

In this study, for the first time, a human with quadriplegia regained 
volitional, functional movement through the use of intracortically 
recorded signals linked to neuromuscular stimulation in real-time. 
With use of the investigational system, our C5/C6 participant gained 
wrist and hand function consistent with a C7-T1 level of injury. This 
improvement in function is meaningful for reducing the burden of care 
in patients with SCI as most C5 and C6 patients require assistance for 
activities of daily living, while C7-T1 level patients can live more 
independently. Although invasive, the NBS provides an advantage 
over existing functional electrical stimulation systems that utilize low 
dimensional control signals such as EEG or EMG like the Freehand 
system!*-°. These devices typically allow control over fewer move- 
ments than those demonstrated in this study, because of the relatively 
low information content of their control signal sources compared with 
intracortically recorded signals. 

Complex functional movements that involve both stimulation-induced 
and residual movements, such as those at the shoulder and elbow, pre- 
sented an important challenge in this study. Residual limb movement 
can activate motor cortical neurons”! and some motor cortical neurons 
respond to movement from several joints**. This may explain the neural 
activity differences observed and the difficulties initially encountered 
as the patient attempted movements involving paralysed and non- 
paralysed joint movement simultaneously. The nonlinear algorithm 
presented here is one possible solution to this problem. To allow transfer 
of this work to other patients, further improvements will be required 
on the microelectrode technology, algorithms, and NMES. However, 
the electronic neural bypass presented here demonstrates what is possible 
in the future and can offer hope for movement restoration to people 
living with paralysis worldwide. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 


No statistical methods were used to predetermine sample size. Cues within an 
experiment were randomized, but experiments themselves were not randomized. 
The investigators were not blinded to allocation during experiments and outcome 
assessment. 

Approval for this study was obtained from the US Food and Drug Administration 

(Investigational Device Exemption) and The Ohio State University Medical Center 
Institutional Review Board (Columbus, Ohio). The study met institutional require- 
ments for the conduct of human subjects and was registered on the http://www. 
ClinicalTrials.gov website (identifier NCT01997125). The participant referenced 
in this work provided permission for photographs and videos and completed an 
informed consent process before commencement of the study. 
Study participant and surgical procedure. The study participant had complete, 
non-spastic quadriplegia subsequent to traumatic cervical spine injury sustained 
4 years previously while diving. He had full bilateral elbow flexion (grade 5/5), 
active wrist extension with radial deviation through an incomplete range of motion 
against gravity (grade 2/5), but no motor function below the level of C6. His sen- 
sory level was C5 on the right (because of altered but present light touch on his 
thumb) and C6 on the left. He had intact proprioception in the right upper limb 
at the shoulder for internal rotation through external rotation, at the elbow for 
flexion through extension, at the forearm for pronation through supination, and 
at the wrist for flexion through extension. Proprioception for right digit flexion 
through extension at the metacarpal-phalangeal joints was impaired for all digits. 
His injury was complete, with an overall neurological level of C5 American Spinal 
Injury Association Impairment Scale A with zone of partial preservation for motor 
function to C6 bilaterally according to the International Standards for Neurological 
Classification of Spinal Cord Injury’. Electromyogram with nerve conduction 
studies were performed by a board certified electromyographer to further pheno- 
type the patient’s motor impairments, documenting presence of voluntary motor 
units in the right deltoid (C56), biceps (C56), and extensor carpi radialis (C67), but 
absence of motor units in triceps (C678), pronator teres (C67), extensor digitorum 
(C78), extensor indicis (C78), flexor policis longus (C78), abductor policis brevis 
(C8-T1), and first dorsal interosseus (C8-T1) muscles. 

The patient underwent a left frontoparietal craniotomy. The hand area of motor 
cortex was identified preoperatively by fusing fMRI activation maps obtained while 
the patient attempted movements co-registered to the preoperative planning MRI. 
An intraoperative navigation system was used to plan the craniotomy. The microe- 
lectrode array was then implanted into the cortex using a pneumatic inserter. The 
pedestal was tunnelled under the skin to a posterior exit point. Reference wires 
were placed subdurally according to the manufacturer’s guidelines. 
fMRI acquisition. A 3 T MRI system (Philips Achieva) equipped with an 
eight-channel head coil was used to collect ten runs of functional images with a 
blipped single-shot gradient-echo EPI imaging sequence with a spatial resolution of 
2.2mm x 2.2mm x 2.3mm. The acquisition parameters were repetition time/echo 
time (Tp/Tg) 2,000/30 ms, 80° flip angle, 104 x 75 matrix size, 230 mm x 168mm 
field of view, 27 slices, and echo train of 75 ms. The number of dynamic acquisi- 
tions was 165 and each run took 5.5 min. The field of view of the functional scans 
did not cover the whole brain, but did cover the superior motor strip bilaterally. 
A high-resolution T1-weighted image was also acquired with the following param- 
eters: Tp/Tp 7.7/3.5 ms, 1mm x 1mm x 1 mm voxel resolution, three-dimensional 
acquisition with 144 slices. 

We asked the participant to visualize movements of his right hand while we 
collected {MRI. We collected ten blocks, each with ten trials. Each trial consisted 
ofa fixation cross displayed for 1.4, followed by a 5s video depicting a movement 
beginning after a 0.1 s delay. Each movement video was repeated three times with 
a 1.5s interval between repetitions. The orientation cross for the next set of videos 
was displayed after a jittered inter-trial interval of between 8 and 12s, as well as 
two sets of three repetitions of 5s videos depicting a right hand at rest. Each video 
began with a right hand in the resting position of his right hand. The hand in the 
video then executed a specific motion and held that movement before returning 
to the rest position. The videos depicted the following eight movements: clenching 
hand into a fist, wrist flexion, wrist pronation and supination, drumming fingers by 
sequentially touching the palm with the fifth to second fingers, splaying all fingers 
to maximum extension, flexing the first finger, flexing the second finger, flexing 
the third finger; and finally resting in which the hand did not move. 

To obtain the final results, we contrasted all of the movement conditions against 
the rest condition and found that the areas with the highest activation were in the 
left motor and sensory cortex. The location of the microelectrode array implanta- 
tion overlapped with an area in the motor strip where activity in response to motor 
visualization was observed. 

Functional MRI processing. Functional MRI processing was performed using AFNI*4 
and FSL”’. We dropped the first three volumes of each run, then removed spikes 
with 3ddespike and regressed out cardiac and respiratory signals with 3dretroicor. 
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Next, we temporally shifted all of the slices in each volume to the start of the 
volume’s collection using 3dTshift with Fourier interpolation and registered all 
of the functional volumes to the third volume of the first run. We fitted a general 
linear model restricted maximum likelihood with regressors for each of the visu- 
alization movement types and regressors for motion and the derivative of motion. 
The regressor for each visualization movement was a 5s block convolved with a 
haemodynamic response function starting at the onset of each video presentation. 
Regressions were fitted with 3dDeconvolve and 3dREMLfit. To identify activity 
related to motor visualization, we contrasted all of the movements versus the rest 
condition and applied a threshold of a t value of 7, then dilated these results so that 
they were visible on the cortical surface. We co-registered the functional images 
with the T1-weighted anatomical image with FSLs grey-matter/white-matter 
boundary-based linear registration. Postoperative computed tomography images 
were linearly registered with the same anatomical image using the FSL image 
registration tool with a mutual information cost function and 6 degrees of freedom. 
The overlap between microelectrode array location and fMRI results was visualized 
with MRIcroGL (http://www.cabiatl.com/mricrogl/). Researchers were not blinded 
during pre-processing or subsequent analyses. 

System architecture. The Neuroport neural data acquisition system and the Utah 
microelectrode array with 1.5 mm electrodes (Blackrock Microsystems) were used 
to acquire the neural data. A 0.3 Hz first-order high-pass and a 7.5 kHz third-order 
low-pass Butterworth analogue hardware filter were applied to the data, and each 
of the 96 channels of the microelectrode array were sampled at a rate of 30,000 
samples per second. The digitized data were then transmitted to a personal com- 
puter where they were decoded to determine which motion was being imagined 
and then encoded to evoke the desired response from the muscles in the forearm. 
The computer communicated with the custom high-definition NMES that drove 
the electrode sleeve wrapped around the forearm. 

Signal processing. The signal processing and decoding/control algorithms were 
all run on a personal computer using MATLAB (version 2012a). The digitized 
data from the Neuroport was processed every 100 ms. Stimulation artefact in the 
data was detected by looking for a threshold crossing at 500\:V that occurred 
simultaneously on at least 4 of 12 randomly selected channels. A 2.5 ms window of 
data around each detected artefact was then removed and adjacent data segments 
were rejoined. This approach removed a large portion of the artefact, but stimu- 
lation effects still remained. Residual artefact, caused by the Neuroport amplifiers 
recovering from a brief period of saturation after each stimulation pulse, led to 
increased signal amplitude during stimulation (see wavelet processing description 
below). The decoders were trained with data from the period before and during 
stimulation to accommodate this residual artefact and allow the user to initiate 
and sustain the correct movement. 

Wavelet decomposition was then applied to the data, using the ‘db4 wavelet and 
11 wavelet scales”®. Wavelet decomposition has been shown to be an effective tool 
in neural decoding applications and provides information encompassing single 
unit, multiunit, and LFP, without requiring spike sorting”. 

In this study four wavelet scales (3-6) were used, corresponding to the multiunit 
frequency band spanning approximately 234-3,750 Hz. The mean of the wavelet 
coefficients for each scale of each channel was calculated every 100 ms and a 1s 
wide boxcar filter was applied to smooth the data. Baseline drift in the data was 
estimated by using a 15s boxcar filter and was subtracted from the smoothed mean 
wavelet coefficients for the corresponding 100 ms window. This drift may be due 
to multiple causes including changes in concentration/focus level and has been 
observed previously**. The mean coefficients of scales 3-6 were then standard- 
ized per channel, per scale, by subtracting the mean and dividing by the standard 
deviation of those scales and channels during the training blocks. The four scales 
were then combined by averaging the standardized coefficients for each channel, 
resulting in 96 values for every 100 ms of data. When present, residual artefact in 
a portion of this 100 ms segment can cause a local increase in wavelet coefficients 
leading to a twofold to eightfold increase in the reported mean for the segment. The 
resulting values were then used as features, termed mean wavelet power (MWP), 
as input into the decoders. Example decoder output scores are shown in Extended 
Data Fig. 5. 

To look at MWP signal quality over the study period, data for all channels were 
collected over 60s at the beginning of each test session where the participant was 
instructed to close his eyes and rest. MWP features with no mean subtraction were 
calculated to approximate the power in the multiunit frequency bands. After an 
initial decline, the MWP stabilized after 150 days after implantation (Extended 
Data Fig. 6). 

Stimulation. The participant received intermittent stimulation during sessions 
(with a maximum of three sessions per week, typically lasting 3-4h including setup 
time). The continuous, variable output score of each decoder was used to drive 
the stimulation level during all tasks. Stimulation hydrogel material (Axelgaard) 
cut into disks, 12 mm in diameter, were applied to the stimulation electrodes. The 
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centre-to-centre spacing of the electrodes was 22 mm along the long axis of the 
forearm and 15mm in the transverse direction. Electrical stimulation was pro- 
vided intermittently in the form of current-controlled, monophasic (biphasic 
recommended for long-term, continuous use) rectangular pulses of a 50 Hz pulse 
rate and 5001s pulse width. Pulse amplitudes ranged from 0 to 20 mA and were 
updated every 100 ms. 

For each motion, the stimulation intensity and spatial pattern were determined 
by using a trial-and-error method. The output score of a given decoder was in the 
—1 to 1 range and when exceeding zero, the system enabled stimulation for that 
movement. If the output scores of multiple decoders exceeded zero simultaneously, 
then the system enabled the movement with the highest decoder score. If ‘graded 
stimulation intensity’ was not being used, then the stimulator would use the full 
(calibrated) stimulation intensity for the desired motion. 

In some cases, graded stimulation intensity was used, which required three 
stimulation intensity levels to be captured. The stimulation intensity levels were 
set so that they produced a low, medium, and high amount of joint deflection 
(visually determined by the operator). The decoder output score, r, was then used 
to set the stimulation intensity level, f(r), during real-time decoding. A piecewise 
linear function was used to interpolate between the three calibrated stimulation 
intensities as follows: 


0, r<o/ 
r(m—1)/0.2+1, 0<r<0.2 
r(h—m)/0.24+2m—h, 0.2<r<0.4 
h, r>0.4 


1o= 


where I, m, and h, correspond to the three stimulation intensities. The thresholds of 
0.2 and 0.4 were derived empirically and were used to compensate for the nonlinear 
response of the muscles to surface stimulation. 

Stimulation spatial patterns and intensity levels were saved in a database. In sub- 
sequent sessions with the participant, the previous calibrations could be recalled. 
See Supplementary Table 1 for electrode labels and total number of stimulation 
electrodes used to evoke each movement during the individual movement task 
and Extended Data Fig. 7 for a representative stimulation pattern used to evoke 
‘hand oper’ movement. 

Task methods and algorithm training. The decoders were trained in blocks, each 
consisting of repetitions of desired motions with short rest periods between. The 
participant was cued by a small graphical hand displayed on a monitor. After each 
block, the decoders were adapted (created or updated) using data from the most 
recent three blocks. During the first block (before a decoder had been built), the 
system provided ‘scripted’ stimulation (stimulation corresponding to the cue) while 
the participant imagined/attempted the cued movements. After the first block, a 
decoder was built and the study participant received real-time, cortically controlled 
stimulation. Every 100 ms, the system updated the stimulation with the electrode 
pattern for the motion with the highest decoder output score. For a short amount 
of time during each cue, referred to as ‘no-feedback time; no stimulation was given 
which provided training data from both the period before motion and during 
stimulation/motion. For tasks that included residual shoulder and elbow move- 
ment, longer movement cues were used to also capture training data for the grasp 
alone and subsequently the grasp with arm movement in space. This training data 
(cues and MWP features) were used to train a nonlinear support vector machine 
algorithm, resulting in more robust decoders. A decoder for each motion (against 
all other motions/rest) was built using a nonlinear Gaussian radial basis function 
kernel”? to process the features. The processed features were then input into a 
non-smooth support vector machine algorithm that used sparsity optimization 
to improve decoder accuracy by zeroing out the least valuable MWP features}. 
During decoding, all decoders ran simultaneously and the decoder with the highest 
output score above zero was used to drive the stimulator. 

Individual movement task. During the individual movement task training, the 
participant was cued by the graphical hand for all six wrist and finger motions. The 
decoders were trained in seven successive blocks, each consisting of three trials 
of each movement. Each cue lasted 2.5s and was always followed by a rest period 
of random length (2.5-4.0s); the movement cues were randomly shuffled. The 
‘no-feedback time’ was set to 1.2 in the first block and 0.8 s in subsequent blocks. 
The test portion of the ‘individual movement task’ used the same cueing param- 
eters as during training, except each movement cue was repeated five times and 
the ‘no-feedback time’ was disabled. For this task, graded stimulation intensity was 
not used. Overall accuracy for the neural decoder was calculated by the fraction of 
100 ms segments where the neural decoder agreed with the cue. 


To quantify finger and hand movements in the individual movement task, 
coloured finger cots (coverings) were placed on the participant's fingers. For detect- 
ing movements involving the wrist, an additional cot was placed on a plastic cylin- 
der extending out past the participant’s thumb. A Bumblebee2 stereo camera was 
positioned above the participant's hand to track movement in three dimensions. 
The colour of the cots was used to identify individual fingers and locate them in 
three-dimensional space using a combination of custom code and OpenCV™. The 
location of the fingers was then fed into the same decoding algorithm used for 
neural decoding to determine the position of the hand, termed the video classifier. 
This classifier was trained using video data collected at 12 frames per second using 
scripted stimulation for the moves of interest. As previously described, accuracy, 
sensitivity, and specificity were computed. A permutation test was performed by 
comparing the observed data to data with shuffled labels'®. To account for reaction 
and system lag time, the video decoder data were shifted by 1.2s. 

GRASSP assessment. Independent decoders were trained separately for each 
GRASSP test that required stimulation. During the GRASSP training, the par- 
ticipant was cued by the graphical hand for the motion needed for that particular 
task. The duration of the cue and rest periods were randomly selected from a 
uniform distribution across the ranges specified in Supplementary Table 2, which 
also specified the number of repetitions of each cue. Each cue was always followed 
by a rest period. The ‘no-feedback time’ was set to 0.8 s for all blocks. The test 
portion of the task used the same cueing parameters as those used during training, 
except each movement cue was repeated five times and the ‘no-feedback time 
was disabled. The GRASSP tests were conducted without any cues. Since decoder 
sets were trained for each individual GRASSP test (versus training one decoder 
set for all tests), the user could not switch between GRASSP tests at will without 
reloading or training the appropriate decoders. Some GRASSP tests required two 
movements to be trained for one test to open the hand around the object and then 
grip the object. In some GRASSP tests, graded stimulation intensity was used to 
make the gripping action smoother. 

Functional movement task. While performing training for the functional move- 
ment task, the participant was cued by the graphical hand for all three of the 
motions requiring stimulation. Each cue lasted 5.5 s and was always followed by a 
5s rest period. Each movement cue was repeated four times. The movement cues 
were sequenced so that the participant could perform the complete functional 
movement task during training. The ‘no-feedback time’ was set to 1.2s in the first 
block and 0.8 s in subsequent blocks. For the hand open and bottle grip movements, 
the stimulation intensity was not graded; however, graded stimulation intensity 
was used for the stir bar grip to enable a smooth grasp. 

The test portion of the functional movement task was conducted without any 
cues. The participant was verbally instructed when to start upon which a timer 
was initiated. To be considered a successful trial, the participant had to grip the 
bottle, pour at least ten dice into the jar, set the bottle down, pick up a stir stick with 
a pinch grasp, stir the contents with at least two (circular) stirs, and leave the stir 
stick in the jar (at which time the timer was stopped). No objects could be dropped 
or knocked over during the task and the trial had to be completed within 60s. He 
was given a total of 10 min to complete as many trials as possible (with a mandatory 
60s rest period between each trial). Sixty seconds of rest was chosen to give the 
researcher time to reset the props of the task and to give the participant time to rest 
physically and mentally. Because of his injury, the participant had limited ability 
to lift his arm, and repetitive tasks such as this requiring him to do so could cause 
his shoulder to fatigue and become uncomfortable. 
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Extended Data Figure 1 | Neural modulation. a, b, Examples of rasters 
and peristimulus histograms from simultaneously recorded units 

with response to attempted or performed wrist, elbow, and shoulder 
movements are shown in (a) for channel 68, unit 1 and in (b) for channel 
37, unit 1. The participant was presented with cues to attempt wrist flexion 
(WB), wrist extension (WE), wrist radial deviation (WR), wrist ulnar 
deviation (WU), elbow extension (EE), or to perform elbow flexion (EF), 
shoulder flexion (SF), or shoulder abduction (SA). Each cue was presented 
for a duration of 3s with a random jitter of 0-2 s followed by a 3s period 
with a jitter of 0-2 s. We presented 20 trials of each in random order. The 
top part of each subpanel is a raster, the black dots represent spikes, each 
row of spikes represents data from one trial. All trials were aligned on cue 


Channel 37, Unit 1 


Trial 
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presentation (time zero, red dashed line). On the right of each set of panels 
is an example of 500 randomly selected waveforms from the discriminated 
unit (blue). The thick yellow line represents the average waveform for 

the unit. The top panel shows unit 1 on channel 68. This unit responded 
well to movement around the shoulder and elbow. The bottom panel 
shows activity from unit 1 on channel 37 that responded to wrist, elbow, 
and shoulder movements. Because the participant was asked to imagine 
and hold the movement throughout the cue period, the latent increase of 
activity after the cue ended (during the rest period) was probably due to 
the participant imagining the antagonist movement to return to a neutral 
position. 
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Extended Data Figure 2 | Stimulation artefact removal. Stimulation 
artefact was removed from the signal associated with each electrode before 
further processing (see Methods). a, Example of the voltage data during 
stimulation for a single electrode shows a period before stimulation and 

a period during stimulation containing three stimulation artefacts (large 
amplitude pulses). b, Large artefacts were detected and removed to reduce 
their effects; however, residual artefacts could remain (causing a twofold 
to eightfold increase in MWP during stimulation). c, The period before 
stimulation is shown for reference. d, The period with three stimulation 
artefacts (after removal) is shown after concatenation/rejoining of the 
signal segments, and wavelet decomposition is then performed on these 
shorter, concatenated data (see Methods). 
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Extended Data Figure 3 | Individual movement task performance. The 
participant was visually cued to attempt each of the six trained movements. 
a, Snapshot of each movement. b, The performance (sensitivity, specificity, 
and accuracy) was measured by automatic evaluation of video frames of 
hand movements for each of the six moves. The overall accuracy of the 
NBS was 70.4+ 1.0% (P< 0.01). Statistics for individual movements are 
calculated against a movement specific cue vector with two classes: one 
class for the movement of interest and one class for all other moves plus 
rest, which we refer to as the non-target class. Sensitivity is the percentage 
of video frames during cued movement where the correct movement was 
observed. It captures the ability of the participant to initiate the specific 
movement in response to the cue and his ability to sustain the movement 
for the duration of the two second cue. Specificity is the percentage of video 


frames during non-target cues that were correctly identified. Accuracy is the 
percentage of video frames where the observed hand position matched the 
cue vector for that specific movement. It can also be calculated as a weighted 
average of sensitivity and specificity, with the weights corresponding 

to the relative frequency of the two classes. As such, the sensitivity and 
specificity give additional performance information which can be especially 
useful when the relative frequency of the classes is skewed towards one 

class. Overall accuracy measures the performance of all six decoders 
simultaneously by calculating the percentage of video frames where the 
observed hand position matches the position the participant was asked 

to achieve. Thus, errors on any of the six movements are incorporated in 

the overall accuracy, so it is expected to be lower than the accuracy of any 
individual movement. Error bars, +2 s.d. (Video taken by D. Friedenberg.) 
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Extended Data Figure 4 | MWP during a combination movement 

task. MWP as calculated by the NBS algorithm during a task combining 
imagined movement, stimulation-induced movement, and non-paralysed 
muscle movement is shown. The participant's reaction time and a 1s 
boxcar filter used to smooth the neural data creates the delay observed 
after the cues are presented. The participant’s hand was placed on top of 

a spoon set on a table and he was cued by the graphical hand to imagine 
gripping a spoon (solid black line), provided stimulation to evoke the 
actual gripping of the spoon (red line), and cued by an audible beep to 
transfer the spoon using the residual movement in his shoulder and elbow 
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(black dashed line). The spoon transfer distance was approximately 45 cm 
to the left/right, alternating between each trial. Differences in MWP 

can be seen for each of the three portions of each trial. The increase in 
MWP between imagined movement and stimulation-induced movement 
is caused by residual stimulation artefact. During the transfer cue, the 
utilization of residual (shoulder) movement was associated with a different 
and consistent MWP pattern, requiring the development of a more robust 
decoding strategy which combined data from imagined, stimulation- 
induced, and transfer movements for decoder training (see Methods). 
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Extended Data Figure 5 | Neural decoder outputs foreach movement in _ it is determined by the maximum of all the decoder outputs. Green 


the individual movement task. The blue line represents the cue that the stars are placed above the 29 out of 30 cues in which the neural decoder 
participant was trying to match. For a particular movement, when the blue _ correctly matched the cue. In this plot the cues have been shifted by 0.8 s to 
line was at one, the user was prompted to imagine that movement. The account for reaction and system lag time. Decoder output below —0.5 has 
black line is the decoder output for that move. The decoded movement is been set to —0.5 for visual clarity. 


rest if none of the decoder outputs are above zero (dotted line); otherwise, 
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Extended Data Figure 6 | Evolution of MWP over time. At the beginning _ were calculated to approximate the power in the multiunit wavelet bands 
of each test session, data for all channels were collected over 60s where corresponding to scales 3-6. After an initial decline the MWP stabilized 
the participant was instructed to close his eyes and rest. No stimulation at 150 days after implantation. Dashed line represents a polynomial fit of 
was provided during this period. MWP features with no mean subtraction _ order 4. 
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Extended Data Figure 7 | Stimulation electrode pattern. The 
stimulator was calibrated to evoke movements in the hand and wrist. 


a, Representative map of the anode (red) and cathode (black) electrodes 
in the lower (L) and upper (U) stimulation cuffs used to evoke the ‘hand 
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open movement during the individual movement task. b, Stimulation 
cuffs on the participant’s arm, with the stimulation pattern highlighted. 
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See Supplementary Table 1 for a complete list of electrode patterns used for 
the individual hand movement task. (Photographs taken by N. Annetta.) 
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Self-organization of the in vitro attached human 


embryo 


Alessia Deglincerti!*, Gist F. Croft'*, Lauren N. Pietila', Magdalena Zernicka-Goetz?, Eric D. Siggia® & Ali H. Brivanlou! 


Implantation of the blastocyst is a developmental milestone in 
mammalian embryonic development. At this time, a coordinated 
program of lineage diversification, cell-fate specification, and 
morphogenetic movements establishes the generation of extra- 
embryonic tissues and the embryo proper, and determines the 
conditions for successful pregnancy and gastrulation. Despite its 
basic and clinical importance, this process remains mysterious 
in humans. Here we report the use of a novel in vitro system)” to 
study the post-implantation development of the human embryo. 
We unveil the self-organizing abilities and autonomy of in vitro 
attached human embryos. We find human-specific molecular 
signatures of early cell lineage, timing, and architecture. Embryos 
display key landmarks of normal development, including epiblast 
expansion, lineage segregation, bi-laminar disc formation, amniotic 
and yolk sac cavitation, and trophoblast diversification. Our 
findings highlight the species-specificity of these developmental 
events and provide a new understanding of early human embryonic 
development beyond the blastocyst stage. In addition, our study 
establishes a new model system relevant to early human pregnancy 
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loss. Finally, our work will also assist in the rational design of 
differentiation protocols of human embryonic stem cells to specific 
cell types for disease modelling and cell replacement therapy. 
Recently established in vitro implantation platforms in the mouse 
have expanded our knowledge of post-implantation development 
and recapitulated in vivo early developmental landmarks'”. However, 
extrapolation from mouse to human is limited since mammalian 
embryos display species-specific differences in post-implantation 
morphology and many molecular markers have not been validated*>. 
Therefore, to gain insights into the self-organizing abilities of human 
embryos, we sought to establish an in vitro attachment platform. To 
determine the dynamics of cell fate specification, we monitored the 
morphology and the expression of cell-type-specific markers in pre- 
and post-attachment embryos: the inner cell mass (ICM) and epiblast 
(Epi) markers OCT4 and NANOG, the ICM and primitive endoderm 
(PE) marker GATA6, and the trophectoderm (TE) markers CDX2 
and GATA3 (ref. 6). Additionally, to distinguish TE subtypes, we used 
cytokeratin 7 (CK7) and 3B-human chorionic gonadotropin (HCGB)’. 
Embryos were fixed and stained every other day starting at day 


Figure 1 | D.p.f. 6 human blastocyst embryos 
display human-specific transcriptional 
profiles. a, DIC image of a d.p.f. 6 human 
blastocyst; scale bar, 100j1m. b, Cartoon 

of a d.p.f. 6 embryo with salt-and-pepper 
distribution of OCT4 (green) and OCT4/GATA6 
red) ICM cells, and GATA3 (blue); mural 
arrow) and polar (arrowhead) TE cells. 

c-q, Immunostaining of d.p.f. 6 blastocysts. 

c, Three-dimensional rendering of the front 
half of a d.p.f. 6 blastocyst (left) and ICM-zoom 
right). DAPI (white) and phalloidin (magenta, 
actin, virtual channel). d-j, Three-dimensional 
rendering of a d.p.f. 6 blastocyst (top) and 
ICM-zoom (bottom, box in j) stained for OCT4 
green), GATA6 (red, virtual channel), GATA3 
blue) (n = 3-8); arrows indicate high marker 
levels in ICM, arrowhead low levels in TE. 

k-I, Whole embryo (top) and ICM-zoom 
bottom, box in 1) of the d.p.f. 6 blastocyst from 
c; OCT4 (green), GATA6 (red, virtual channel), 
CDX2 (cyan) (n=5). m-q, ICM-zoom of d.p.f. 
6; OCT4 (green), NANOG (magenta), GATA6 
and CD24 (red, n = 3). Scale bar, 100 1m for 
whole embryos, 201m for ICM-zooms. 
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post-fertilization (d.p.f.) 6, and counterstained with 4’,6-diamidino- 
2-phenylindole (DAPI) to detect and count nuclei and phalloidin to 
delineate cell boundaries (Supplementary Video 1). 

The pre-implantation human blastocyst (d.p.f. 6-7) is a hollow 
ball of cells composed of tightly packed, small nuclei in the ICM 
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Figure 3 | D.p.f. 10 embryo cell lineages diversify and self-organize 
amniotic and yolk sac cavities. a, b, DIC image of a d.p.f. 10 embryo 
(scale bar, 501m) (a) and cartoon of a section through the embryo 

(b); Epi (green), amniotic cavity (arrow), PE (red), yolk sac cavity 
(arrowhead), ysTE (orange), TE (blue). c, Three-dimensional rendering of 
ad.p.f. 10 embryo stained with OCT4 (green), GATA6 (red), and GATA3 
(blue) (1 = 4), flanked by xz (top) and yz (side) views at coordinates 
indicated by the yellow lines, with amniotic (arrow) and yolk sac cavities 
(arrowhead). Scale bar, 501m. d, Optical section of the central region from 
c; OCT4 (green), GATA6-phalloidin (red), and CDX2 (cyan) (n=4). Scale 
bar, 201m. e-h, Optical section of the dashed box in d, but on a z-plane 
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Figure 2 | Attached d.p.f. 8 human embryos 
begin transcriptional and morphological 
self-organization. a, b, DIC images of (a) 
attaching d.p.f. 7.5 embryos with the TE 

(top left) collapsing onto the embryo and 

(b) attached and compacted d.p.f. 8. Scale 

bar, 50 1m. c-e, Cartoons of blastocyst (c) 
floating, (d) orienting to attach on the side of 
the polar TE, and (e) attached; Epi (green), 

PE (red), TE (blue). f-p, Optical sections of 
d.p.f. 8 embryos stained with the indicated 
markers (n= 4). Scale bar, 201m. q, Three- 
dimensional rendering of the embryo in f-p 
with OCT4 (green), GATA6 (red), and GATA3 
(blue), flanked by xz (top) and yz (side) views 
at coordinates indicated by yellow lines. Scale 
bar, 50 um. r, s, Optical sections of d.p.f. 8 
embryo stained with the indicated markers. 
Scale bar, 20 um. t, u, Carnegie stage 5a section 
(t) and cartoon (u); Epi (green), PE (red). 
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and larger cells in the TE (n =8; Fig. la—c, Extended Data Figs la—c 
and 2a, Supplementary Table and Supplementary Videos 2 and 3). 
AILICM cells showed high intensity OCT4 staining (OCT4™), with 
a subset having high GATA6 (GATA6"!) in a salt-and-pepper man- 
ner (Fig. 1d—f and Extended Data Fig. 2b-d). NANOG was observed 
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6m higher, with the amniotic cavity at the centre (arrow) lined by PE 
(arrowhead). Scale bar, 20|1m. i-p, Optical section of the box in d, but ona 
z-plane 4\1m below with the yolk sac cavity at the centre lined by PE (white 
arrowhead) and ysTE (yellow arrowhead). Scale bar, 20 1m; q-u, Optical 
section of a d.p.f. 10 embryo stained with the indicated markers (n= 4). 
Scale bar, 100 1m. v, w, Optical section of the Epi/PE area of a d.p.f. 10 
embryo stained with OCT4 (green) and GATA6-CD24 (red) (n= 4). 

Scale bar, 20 um. x, y, Carnegie stage 5b section (x) and cartoon (y) with 
Epi (green), amniotic cavity, PE (red), yolk sac cavity, and ysTE (orange). 
ph, phalloidin. 
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Figure 4 | D.p.f. 12 embryos exhibit characteristic of CS5C in vivo 
including TE cellular phenotypes. a, b, DIC image of a d.p.f. 12 embryo 
(a) (scale bar, 501m) and cartoon of a side view (b) with Epi (green), 
amniotic and yolk sac cavities, PE (red), ysTE (orange), and TE (blue). 

c, Three-dimensional rendering of a d.p.f. 12 embryo stained for OCT4 
(green), GATA6 (red), and GATA3 (blue) (7 = 2); flanked xz (top) and 
yz (side) views at indicated coordinates. d-m, Epi zoom of c stained 
with the markers indicated; the arrow in e points at the amniotic cavity. 
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exclusively in OCT4* ICM cells, which did not stain for GATA6 
(Fig. lm-q). No GATA3 or CDX2 were detected in the ICM. All 
TE cell nuclei stained for GATA3, as well as low intensity OCT4 
(OCT4!°), and GATA6 (GATA6!®; Fig. 1g-j) but not NANOG. 
Variable levels of CDX2 were also detected in TE (CDX2'; Fig. 1k-l 
and Extended Data Fig. 2e, f); however, in most cases, surprisingly, the 
signal was localized to the cytoplasm instead of the nucleus (Fig. 1k-l). 
No HCGB and weak, non-filamentous CK7 were detected (Extended 
Data Fig. 2g; n=6). The ubiquitous OCT4 pattern we report is in 
agreement with previous staining of human blastocysts at d.p.f. 5-7 
(refs 6, 8, 9), and stands in contrast to the mouse where it is ICM spe- 
cific. GATA6 expression in the ICM is different from the mouse and 
previous reports in humans®*”. In the mouse, the salt-and-pepper 
ICM distribution becomes restricted to PE and physically sorted from 
the Epi population by late blastocyst'®. Our data suggest that physical 
sorting between Epi and PE has not yet occurred in human d.p.f. 6, 
even if the transcriptional profile is consistent with specification of 
presumptive Epi and PE fate. While both CDX2 and GATA3 are TE 
specific, we find that GATA3 is a better marker of human TE because 
of its high signal and consistent nuclear localization. Our results also 
confirm the molecular profiles from single-cell RNA sequencing stud- 
ies of laser-dissected blastocyst (d.p.f. 5-6)'!, and expand them by 
providing new spatial resolution. Thus, at d.p.f. 6, we define two cell 
populations in the ICM: OCT4"/NANOG*/GATA6~ (presumptive 
Epi precursors), and OCT4"/GATA6#!/NANOG- (presumptive 
PE precursors); we also define one population of TE cells: OCT4*°/ 
GATA6"°/GATA3*/CDX2°*. 

We next allowed blastocysts to attach using a recently established 
mouse in vitro implantation platform!. We found that most human 
blastocysts (70%, n = 91) attached between d.p.f. 7 and 8 (Fig. 2a, b). 
Interestingly, despite their random original orientation, by d.p.f. 
7 the blastocysts always attached on the side of the polar TE, the 
portion of TE that is the closest to the ICM (Fig. 2c-e). At d.p-f. 8, all 
embryos adopted a flattened structure (Fig. 2b, e, f, Extended Data 
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Scale bar, 20j1m. n-q, Optical section of the embryo in c re-stained with 
the indicated markers (n = 2); scale bar, 100|1m. r-v, Appearance of 

ST phenotypes; multinucleated ST cells, DAPI (white), GATA3 (blue), 
phalloidin (red), HCGB (yellow), and nascent lacuna (arrow). Scale bar, 
20m. w-x, Carnegie stage 5c section (w), and cartoon (x) of Epi (green), 
amniotic and yolk sac cavities, PE (red), ysTE (orange), CT, ST, and 
lacunae in TE (blue). 


Fig. 3a, Supplementary Table and Supplementary Video 4). While 
the total cell number remained constant, the relative number of cells 
contributing to different territories changed (Extended Data Fig. la-c 
and Supplementary Table). Two discrete territories emerged from 
the ICM, and formed the Epi and PE lineages, respectively charac- 
terized by OCT4"/ GATA6” and OCT4 / GATA6™! (Fig. 2g-i). 
Epi cells formed a tight core at the centre of the attached embryos 
(Fig. 2g). Phalloidin staining was extremely faint in all embryos in 
two separate experiments, suggesting qualitatively different actin 
status in the immediate post-attachment period. In the TE, OCT4 and 
GATA6 staining completely disappeared. CDX2 remained at very low 
and variable levels and was restricted to weak nuclear staining of a 
subset of TE cells at the periphery (Fig. 2j-l), while GATA3 demar- 
cated all OCT4~ and GATA6~ TE nuclei surrounding Epi and PE 
(Fig. 2m-q). At this stage, all GATA3* cells also showed strong 
filamentous CK7 staining (Fig. 2r-s), a marker of general TE lineage 
and cytotrophoblast (CT), suggesting molecular progression of TE 
differentiation compared with d.p.f. 6; however, no HCGB, a marker 
of syncytiotrophoblast (ST), was detected (Extended Data Fig. 3b)’. 
Taken together, this evidence suggests that in humans, Epi/PE cell 
sorting occurs at a later stage than in the mouse, around the time of 
implantation. 

At d.p.f. 10, the attached embryos increased in size while maintain- 
ing a flattened morphology (Fig. 3a—c, Extended Data Fig. la~c and 
3b, Supplementary Table and Supplementary Video 5). The OCT4™! 
Epi at the centre of the embryo was more compacted, and formed 
a cavity delineated by phalloidin: the putative amniotic cavity (Fig. 3d-h, 
Extended Data Figs 4a-r and 5 and Supplementary Videos 5-7). To ask 
if the Epi population had progressed in its differentiation, we stained 
for CD24, which has been reported to increase in primed (Epi-derived) 
versus ‘naive’ mouse and human embryonic stem cells (ESCs)!*. CD24 
was selectively observed on the surface of human Epi cells starting only 
at d.p.f. 10 (n=4 of 4; Fig. 3v-w and Extended Data Fig. 4s—x), and 
was not detected at d.p.f. 6 (Fig. 1o; 1 =3 of 3) or d.p.f. 8 (Extended 
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Data Fig. 3c; n=4 of 4), suggesting a developmental transition. All Epi 
nuclei were also NANOG* (Extended Data Fig. 4s—x). It is therefore 
the d.p.f. 10 Epi population (OCT4*/NANOG*/CD24°), and not the 
d.p.f. 6 or d.p.f. 8 Epi, that most closely matches the profile of human 
ESCs”, which supports a recent hypothesis on the developmental 
origin of human ESCs’. GATA6™! PE cells were distributed in a layer 
juxtaposed to one side of the Epi, delineating the bi-laminar germ 
disc (Fig. 3c-p and Extended Data Fig. 4a-r). On the other side of the 
PE cells, a second, larger cavity self-organized: the putative yolk sac 
cavity (Fig. 3i-p, Extended Data Figs 4l-r and 5 and Supplementary 
Videos 5 and 6). A novel population of CDX2+/GATA6!9/OCT4"° 
cells, never described before in other mammals, lined the yolk sac 
cavity in all embryos examined (n= 4). While it is possible that 
these cells represent parietal endoderm, we believe that they are a 
novel population because of their unique molecular signature. 
We therefore named these cells ‘yolk sac trophectodern’ or ysTE 
(Fig. 3p). GATA3* cells continued to demarcate the entire TE (Extended 
Data Fig. 4a-k), which also co-stained for CK7 (Fig. 3q-u). However, 
a subset of GATA3*/CK7* cells now stained for HCGB (Fig. 3t), 
suggesting the emergence of the HCGB* ST lineage’. Collectively, the 
architecture of the embryo at this stage closely resembles Carnegie 
stage 5b-c’3, with amniotic cavity, yolk sac cavity, bi-laminar disc in 
between, and TE differentiation (Fig. 3x-y). This demonstrates that 
the embryo alone can direct both lineage specification and diversifi- 
cation, as well as tissue morphogenesis and architectural organization, 
without maternal input. 

At d.p.f. 12, the amniotic and yolk sac cavities, albeit present, 
appeared collapsed (Fig. 4a-l, Extended Data Figs 6 and 7a-j, 
Supplementary Table and Supplementary Video 8). Epi and PE 
expression profiles were unchanged from d.p.f. 10, while TE showed 
further differentiation and organization. HCGB staining increased 
in intensity and clusters expanded to generate multinucleated cells 
with lacunae, the in vivo site of maternal blood vessel invasion, 
therefore adding functional cytological characteristics of ST lineage 
progression (Fig. 4n—q and Extended Data Fig. 7k—-o). TE subtypes 
self-organized in concentric rings, with a ring of CT surrounding 
the Epi, multiple foci of ST towards the edge of the embryo, and a 
layer of CT adjacent to the substrate (Fig. 4r—-v). These features 
closely mimic and provide the first molecular signature of histo- 
logical samples!* (Carnegie stage 5b-c; Fig. 4w-x). We concluded 
our experiments at d.p.f. 14, in accordance with internationally 
recognized bioethical guidelines'*'*. Outgrowths at this late stage lost 
interpretable relation to in vivo correlates’? (n = 8), suggesting the lim- 
itations of our two-dimensional culture environment. 

In this study we show that by simply providing an attachment sub- 
strate, human blastocysts self-organize to recapitulate many key fea- 
tures of in vivo development, surprisingly independently of maternal 
input at least up to d.p.f. 12. Additionally, the differences between 
human and mouse embryos, including unique architecture, cell types, 
and tissue organization, emphasize the necessity of working with 
human embryos to understand human development. By defining the 
molecular composition of cell lineages in the early embryo, our data 
provide a new ex vivo reference to reinterpret controversial discrep- 
ancies between mouse and human stem cells and their derivatives. 
This platform also constitutes a new model for a variety of poorly 
understood placental and embryonic disorders and early pregnancy 
loss. These and future studies on the origin and regulation of early 
embryonic cell types will lead to more developmentally based and 
rational approaches to reprogramming, disease modelling, and cell 
replacement therapies. 
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METHODS 


Ethics statement. This work was approved by The Rockefeller University 
Institutional Review Board and was in accord with University policy for research 
involving human embryos. The informed consent process for embryo donation 
complied with the NAS Guidelines for Human Embryonic Stem Cell Research 
and the Tri-Institutional Stem Cell Research Operating Procedures for ESCRO 
Reviewed Research. The Tri-Institutional Stem Cell Initiative Embryonic Stem 
Cell Research Oversight (Tri-SCI ESCRO) Committee determined that the use 
of human embryos for basic developmental studies did not require review and 
approval by the TriSCI ES;CRO Committee because it did not involve derivation 
of, or research with, human pluripotent stem cells. 

Embryo culturing protocol. Cryopreserved d.p.f. 5 embryos were thawed using 
a THAW-KIT1 (Vitrolife) according to the manufacturer’s instructions. Briefly, 
embryos were thawed by placing the cryovials in a 37°C water bath for about 
30-60, collected, transferred, and incubated in ETS1 for 5 min at room temper- 
ature, 22-23 °C transferred and incubated in ETS2 for 5 min at room temperature, 
transferred and incubated in ETS3 for 7.5 min at room temperature, transferred 
and incubated in cryo-PBS for 5 min at room temperature and 5 min at 37°C (not 
in the incubator). Embryos were morphologically scored upon thaw and 24h after 
culture. To remove the zona, embryos were briefly exposed to acidic Tyrode’s 
solution (Sigma). Embryos were seeded on ibiTreat microscopy \1 plates (Ibidi), 
filled with pre-equilibrated IVC1 medium (Cell Guidance Systems). Half of the 
medium was replaced after 24 and 48h in culture. Zona-free embryos typically 
attached 48-60 h after seeding. At this point, the medium was exchanged to IVC2 
(Cell Guidance Systems), and embryos were cultured for the indicated times, with 
half of the medium being replaced with fresh medium every 24h. The embryo 
culturing and imaging protocol has been deposited on Nature Protocol Exchange, 
doi 10.1038/protex.2016.022. 

Immunofluorescence. Embryos were fixed in 4% PFA in PBS for 30 min at 4°C, 
washed three times in PBS, quenched in wash buffer (PBS + 0.1% Triton X-100) 
plus 100 mM glycine, blocked (wash buffer + 0.1% sodium azide + 10% normal 
donkey serum), stained for 3-12h with primary antibodies in blocking buffer, 
washed, stained with labelled secondary antibodies (Alexa Fluor 488, 555, and 647), 
DAPI, and phalloidin 647, and imaged (see below). To maximize the molecular 
information from each embryo, we followed several sequential staining strategies. 
First, in preliminary stains we found that GATA6 was exclusively nuclear and 
generated almost no background staining; therefore, to maximize information 
content from each embryo, we stained all embryos with phalloidin-647 (the same 
fluorescence channel with GATA6) since these could be segregated during image 
analysis using a three-dimensional segmentation mask on the DAPI to generate a 
phalloidin-only or a GATA6-only virtual channel. Floating blastocysts were stained 
with either DAPI-CDX2-OCT4-GATA6 and phalloidin in the same channel, or 
DAPI-GATA3-OCT4-GATA6 and phalloidin in the same channel since CDX2 
and GATA3 antibodies were of the same species. After analysis and quantification 
of all attached embryos (d.p.f. 8-14), CDX2 cell numbers were found to be very 
small and weakly stained (except for the ysTE cells at d.p.f. 10-14); therefore all 
attached embryos were then re-stained with GATA3 (same species, then same 
colour (Alexa Fluor 555) as CDX2) and reimaged. The staining for GATA3 was 
substantially brighter than the previous mouse CDX2-555 signal, except for the 
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ysTE cells at d.p.f. 10-14. Therefore an extra virtual channel could be superimposed 
on the same embryos to maximize spatial information, indicating bona fide GATA3 
expression in previously CDX2~ or CDX2!" TE cells. Similarly, since GATA6 
nuclear staining was absent at d.p.f. 14, we re-stained two embryos each with rab- 
bit anti-OTX2 and rabbit anti-SOX2. To further enhance molecular resolution, 
we then probed TE lineage diversification by staining previously stained attached 
embryos or fresh day 7 blastocysts. For previously stained attached embryos, we 
observed that phalloidin completely detached by 2 weeks after staining and was 
no longer detectable, therefore leaving the 647 channel open except for GATA6t 
nuclei. We therefore blocked embryos with proteolytic Fc fragments (mouse and 
rabbit), then with unlabelled donkey anti-mouse and -rabbit IgG Fab1, then stained 
with 647- and 594-labelled donkey anti-mouse and -rabbit IgG Fab] (all Jackson), 
and imaged the embryos to measure background and residual crosstalk. We next 
stained with primary antibodies to trophoblast markers CK7 and HCGB followed 
by 647- and 594-labelled donkey anti-mouse and -rabbit IgG Fab1 and reimaged. 
Primary antibodies used were the following: goat-OCT4, Santa Cruz 8628, 1:500; 
rabbit-GATA6, Cell Signaling 23345, 1:1,000; mouse-NANOG, BD 560482, 
1:500; mouse-GATA3, Pierce MA1-028, 1:100; mouse-CDX2, BioGenex CDX2-88, 
1:10; rabbit-CK7 Abcam 181598, 1:400; mouse-HCGB, Abcam ab9582, 1:100; 
mouseCD24-PE conjugate, Abcam 7729, 1:5; rabbit-SOX2, Cell Signaling 35795, 
1:200; rabbit-OTX2, Abcam 114138, 1:200. Labelled secondary antibodies, DAPI, 
and phalloidin-Alexa Fluor 647 (Life) were incubated for 3-12h and washed before 
imaging the samples. 

Image acquisition, analysis, and rendering. Z-stack images were acquired on 
a Leica SP8 inverted confocal microscope at 12 bits in 1024 pixels x 1024 pixels 
using an HCX PL APO CS x 20/0.75 numerical aperture air-immersion or an HC 
PL APO C82 x 40/1.10 numerical aperture water-immersion objective, at 1 Airy 
unit pinhole diameter and 0.985-1.9 1m spacings with manual laser-power Z inten- 
sity-compensation, and avoiding detector saturation with laser excitation/HyD 
detector emission settings, in nanometres, as follows: 405 diode laser 405/415-486, 
WLL 492/500-550, WLL 552/560-621, and WLL 649/657-710 for DAPI, Alexa 
Fluor 488, 555, 594, and 647 respectively. When 594 was imaged, the detector 
setting for Alexa Fluor 555 was adjusted to 560-580, and a fifth track was added, 
594 WLL 594/602-631. Images were then deconvolved with a three-dimensional 
blind algorithm (ten iterations) using AutoDeblur X software (Autoquant). Images 
were rendered and visualized in Imaris, where total and marker-positive nuclei 
were quantified by a spot finding algorithm with manual adjustment for fluo- 
rescence intensity threshold and quality. Where antibodies or tissue generated 
non-specific background staining, a nuclear mask was first generated from the 
DAPI signal, and spots were counted on masked channels as above. Final images 
were processed and assembled using Image]. For the three-dimensional renderings 
and movies, individual channels were gamma-adjusted to enhance visualization. 
Statistical analysis. No statistical methods were used to predetermine sample 
size. The experiments were not randomized. The investigators were not blinded to 
allocation during experiments and outcome assessment. 

Errors and error bars represent the s.e.m. from a minimum of four independent 
embryos unless otherwise indicated. The figures display representative results. 
Unless otherwise specified, the results were the same across all the embryos 
analysed. 
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Extended Data Figure 1 | See next page for caption. 


© 2016 Macmillan Publishers Limited. All rights reserved 


Extended Data Figure 1 | Quantification of cells positive for 
transcriptional markers over d.p.f. 6-12 of in vitro culture. a, Time 
course of total cell numbers via DAPI quantification. Confocal data 

sets for each embryo were scored for total cell numbers in Imaris. The 
mean + s.e.m. is shown for d.p.f. 6, 8, 10, 12 (n=8, 4, 4, 2, embryos per 
time point respectively over n= 2, 2, 2, 1 independent experiments). 

A total of 91 embryos were thawed and 64 (70%) showed developmental 
progression and attached; 26 of 64 (40%) embryos showed normal size 
and morphology and were analysed for markers of cell fate. The remaining 
embryos all showed abnormal development, represented by few (0-5) and 
scattered OCT4 cells, and were excluded from analysis as obvious failures 
of development. Subsequent to staining and analysis of this set of embryos 
with these core markers, six d.p.f. 7 and five d.p.f. 10 embryos showing 
normal OCT4 development were also stained and analysed for additional 
markers. b, Total cells expressing cell type markers, mean +s.e.m. for 
OCT4, GATA6, CDX2, and GATA3: d.p.f. 6 (n =8, 7, 5, 3 respectively); 
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d.p.f. 8 and 10 (n=4 embryos per marker); d.p.f. 12 (1 =2 embryos per 
marker). The line subdividing the OCT4 population identifies the two 
subpopulation of OCT4* cells: OCT4™ (bottom of the bar), passing 

a threshold for high-intensity staining; and OCT4!° (top of the bar), 
passing a low threshold (but not the higher one for OCT4")) for staining. 
At d.p.f. 6, OCT4#"! cells were confined to the ICM while the OCT4!° 
cells represented the TE; between d.p.f. 8 and 12, OCT4!° cells were also 
co-expressing CDX2 (ysTE cells), while OCT4" cells were confined to 
the Epi. GATA6 was scored for high or low (HI/LO) expression at d.p.f. 

6 only, where GATA6"! cells (bottom of the bar) localized to the ICM 
and GATA6!° cells localized to the TE. See also Supplementary Table. 

c, Number of cells expressing lineage-specific markers expressed as 

a percentage of total nuclei; each bar shows mean + s.e.m. for OCT4, 
GATA3, GATA6, and CDX2. The line subdividing the OCT4 bars at d.p.f. 
6-12 and the GATA6 bar at d.p.f. 6 represents the high or low (HI/LO) 
expressing cells as explained above. 


© 2016 Macmillan Publishers Limited. All rights reserved 


LETTER 


Extended Data Figure 2 | Human-specific transcriptional profiles 

of ICM and TE in d.p.f. 6 human blastocysts. a, Additional three- 
dimensional rendering of the front half of a d.p.f. 6 blastocyst (left), and 
ICM-zoom (right). DAPI (white) identifies all nuclei and phalloidin 
(magenta, actin, virtual channel) shows intercellular boundaries between 
TE cells. A three-dimensional segmentation mask on the DAPI channel 
was subtracted from the raw GATA6-phalloidin channel to yield a 
phalloidin-only virtual channel for these panels. This embryo is the same 
as displayed in Fig. 1d-j. Scale bar, 100j1m for whole embryos, 20 1m for 
ICM-zooms. b-f, Top, whole embryo, and bottom, ICM-zooms, stained 
for OCT4 (green), GATA6 (red, virtual channel), CDX2 (cyan), and 


serial merges. b, c, Arrows show that ICM cells have high-intensity OCT4 
with or without GATA6 staining; arrowheads show low-level expression 
of OCT4 and GATA6 in TE cells; e, f, CDX2 staining is weak and 
predominantly cytoplasmic. The GATA6-only channel is the result of an 
inverse mask using three-dimensional segmentation on DAPI to remove 
extra-nuclear phalloidin staining, resulting in nuclear the GATA6-only 
virtual channel. This embryo is the same as displayed in Fig. 1k-l. Scale 
bar, 100 1m for whole embryos, 201m for ICM-zooms. g, No HCGB and 
very weak CK7 signal was observed in d.p.f. 7 embryos (n = 6). Shown 

is a section through a d.p.f. 7 embryo stained with DAPI (white), OCT4 
(green), HCGB (yellow), and CK7 (magenta). Scale bar, 100 jim. 
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Extended Data Figure 3 | Molecular signature of d.p.f. 8-10 embryos. 
a, Embryo reconstruction of a d.p.f. 8 embryo oriented at a 30° pitch; 
individual channels are shown in greyscale and then merged as OCT4 
(green), GATA6-phalloidin (red), GATA3 (blue). Phalloidin staining of 


all d.p.f. 8 embryos was exceptionally weak compared with all other d.p.f. 


(all embryos analysed in two separate experiments); scale bar, 50 zm. 
b, No HCGB staining was detected at d.p.f. 8 (n = 4). Section through a 
d.p.f. 8 embryo showing DAPI (white), OCT4 (green), HCGB (yellow), 
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and CK7 (magenta). Scale bar, 501m. c, No CD24 staining was detected at 
d.p.f. 8 (n= 4). Section through a d.p.f. 8 embryo showing DAPI (white), 
OCT4 (green), CD24 (red). Scale bar, 501m. d, Embryo reconstruction of 
a d.p.f. 10 embryo oriented at a 30° pitch; individual channels are shown 
in greyscale and then merged as OCT4 (green), GATA6-phalloidin 

(red), GATA3 (blue). Phalloidin staining of this embryo was so intense 
that the original underlying GATA6 nuclear stain is not visible in this 
reconstruction; scale bar, 50 1m. 
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Extended Data Figure 4 | Human-specific transcriptional profile of combinatorial merges, showing amniotic cavity formed by OCT4*-only 
d.p.f. 10 embryos. D.p.f. 10 in vitro attached embryos recapitulate the Epi cells (arrow in m and n), flanking GATA6‘ -only PE cells (white 
principal landmarks of Carnegie stage 5b (in vivo d.p.f. 9). a-k, Confocal arrowhead in n), and CDX2*/GATA6*/OCT4!° ysTE cells lining the 
z-sections of a d.p.f. 10 embryo stained for DAPI (white), OCT4 (green), yolk sac cavity (yellow arrowhead in n, 0); the CDX2+/GATA6t/OCT4!° 
GATA6 and phalloidin (GATA6-Ph, red), CDX2 (cyan), GATA3 (blue), merge of this series is shown in Fig. 3d. Scale bar, 20 1m. s—x, Confocal 
and combinatorial merges. The box in f indicates the area including z-section of the Epi/PE area of a d.p.f. 10 embryo stained for DAPI (white), 
amniotic and yolk sac cavities, which is shown in Fig. 3d; scale bar, 501m. OCT4 (green), GATA6 and CD24 (red), NANOG (magenta). This is the 
1-r, Additional confocal z-section of the boxed region in f, stained for same embryo as in Fig. 3v, w. Scale bar, 20 jum. 


DAPI (white), OCT4 (green), GATA6-phalloidin (red), CDX2 (cyan), and 
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Extended Data Figure 5 | Full z-series d.p.f. 10 amniotic and yolk sac cavities. Series of confocal z-sections of the embryo from Fig. 3c stained for 
OCT4 (green), GATA6-phalloidin (red), and CDX2 (cyan), showing the amniotic and yolk sac cavities, with ysTE cells showing expression of varying 
levels of all three markers. The scale is the same as in Fig. 3d. 
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Extended Data Figure 6 | Tilted three-dimensional reconstruction of a d.p.f. 12 embryo. The embryo reconstruction is oriented at a 30° pitch, with 
individual channels shown in greyscale and then merged as OCT4 (green), GATA6-phalloidin (red), GATA3 (blue). Phalloidin staining of this embryo 
was so intense upon re-staining that the original underlying GATA6 nuclear stain is not visible in this data set. Scale bar, 50 jm. 


© 2016 Macmillan Publishers Limited. All rights reserved 


LETTER 


DAPI OCT4 GATA6-Ph 
vote 25 


a 


Extended Data Figure 7 | Additional images from a d.p.f. 12 embryo. CK7 (magenta), and HCGB (yellow). DAPI and OCT4 were used as 

a-e, Confocal z-section of the d.p.f. 12 embryo in Fig. 4c stained for DAPI __ landmarks to identify the same z-plane and the same cells between the two 
(white), OCT4 (green), GATA6-phalloidin (GATA6-Ph, red), GATA3 staining and imaging rounds. The arrow in j and 0 indicates an example 
(blue), and CDX2 (cyan). Scale bar, 100m. f-0, Confocal sections of the of nascent lacuna, typical of ST cells at Carnegie stage 5c in vivo; the box 
same d.p.f. 12 embryo stained and imaged first (f-j) for DAPI (white), indicates an example of multinucleated cells characteristic of ST lineage 
OCT4 (green), GATA6-phalloidin (red), and GATA3 (blue) and then progression; zoom-ins of the area in the box are presented in Fig. 4r-v. 
(k-o) re-stained and re-imaged for DAPI (white), OCT4 (green), Scale bar, 20 um. 
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The evolution of cooperation within the gut 


microbiota 


Seth Rakoff-Nahoum!”, Kevin R. Foster? & Laurie E. Comstock? 


Cooperative phenotypes are considered central to the functioning of 
microbial communities in many contexts, including communication 
via quorum sensing, biofilm formation, antibiotic resistance, and 
pathogenesis!~°. The human intestine houses a dense and diverse 
microbial community critical to health?*°, yet we know little 
about cooperation within this important ecosystem. Here we test 
experimentally for evolved cooperation within the Bacteroidales, 
the dominant Gram-negative bacteria of the human intestine. We 
show that during growth on certain dietary polysaccharides, the 
model member Bacteroides thetaiotaomicron exhibits only limited 
cooperation. Although this organism digests these polysaccharides 
extracellularly, mutants lacking this ability are outcompeted. In 
contrast, we discovered a dedicated cross-feeding enzyme system 
in the prominent gut symbiont Bacteroides ovatus, which digests 
polysaccharide at a cost to itself but at a benefit to another species. 
Using in vitro systems and gnotobiotic mouse colonization models, 
we find that extracellular digestion of inulin increases the fitness of 
B. ovatus owing to reciprocal benefits when it feeds other gut species 
such as Bacteroides vulgatus. This is a rare example of naturally- 
evolved cooperation between microbial species. Our study reveals 
both the complexity and importance of cooperative phenotypes 
within the mammalian intestinal microbiota. 

A major challenge facing the study of host-associated microbiotas 
is to understand the ecological and evolutionary dynamics that shape 
these communities”*!""'?, A key determinant of microbial dynamics is 
the balance of cooperation and competition both within and between 
species”*!*'°. Here we test for the evolution of cooperation within the 
mammalian microbiota by focusing on the Bacteroidales, the most 
abundant order of Gram-negative bacteria of the human intestine with 
species that co-colonize the host at high densities of 10°-10'! CFU per 
gram of faeces'®!”. Members of this order break down polysaccharides 
outside of their cell using outer surface glycoside hydrolases®'’, some 
of which are secreted on outer membrane vesicles*’*. This suggests a 
significant potential for one cell to cooperatively feed other cells. 
As extracellular digestion is considered important for growth of 
Bacteroidales on polysaccharides®*, we focused on this trait as a can- 
didate for cooperative interactions within the gut microbiota and asked 
whether a bacterium that breaks down a polysaccharide extracellularly 
receives most, or all*°, of the benefits of its efforts. 

We first made isogenic mutants in genes responsible for the extracell- 
ular digestion of polysaccharides in the well-studied human gut strain, 
Bacteroides thetaiotaomicron VP1-5482 (refs 1, 8). Specifically, we deleted 
the genes encoding the outer surface glycoside hydrolases BT3698 of the 
amylopectin/starch utilization locus and BT1760 of the levan utilization 
locus (Fig. 1a), required for growth on amylopectin and levan, respectively 
(Fig. 1b, c, Extended Data Fig. 1a, b). Consistent with previous observa- 
tions!*, neither mutant grew with the specific polysaccharide in mon- 
oculture (Fig. 1b, c, Extended Data Fig. la, b). However, co-culture 
of ABT3698 or ABT1760 with wild type in amylopectin or levan 


increased the fitness of the mutants (Fig. 1d, e, Extended Data Fig. 1c, d). 
This is consistent with cooperation via public good availability of 
amylopectin and levan breakdown products. 


a SusC homologue |) Inner membrane monosaccharide importer 

Hl SusD homologue /) Fructokinase 

Ml GH/SPII IB Hybrid two component sensor 

Wi GH/sP! 

lv D> i> iP i> E> >> === 
ute] Y ta S D DS A \& 
o Xe) © oO ) oO Ao ‘o 
i SEE EL SEE x 


Amylopect) “i Z> i> Li? i> i> i> => 


a) * 5 Y wn ce) Oy D 
Pr Lo Ss Ss S WW SF SS 
ew N oy os SS NS LL 
b —e BiWT =e Bt A3698 c —e BtWT —e Bt A1760 
0.8 Amylopectin 0.5 Levan 
“Ee 
3 0.6 0.4 
= 0.3 
2 0.4 
§ 0.2 
fo} 
B02 0.1 
< 
0+ T T 1 OF T T 1 
ie) 20 40 60 0O 20 40 60 
Time (h) 
d MMBtWT  ss=Startof culture e MB Sw 
Ml Bt A3698 E = End at early stationary Ml Bt A1760 
1010 4010: 
10° P=0.04 499 P = 0.004 
E P = 0.0003 P= 0.03 
cy 
2 108 
ps | 
is) 
: | a a A | 
406 | 
E E Ss E E 
eS & ec Ss & ec & 
w w w w 


Figure 1 | Direct and cooperative benefits of polysaccharide digestion 
by surface glycoside hydrolases (GH). a, Polysaccharide utilization loci 
of B. thetaiotaomicron (Bt) for amylopectin and levan with the products or 
properties each gene encodes listed above and colour-coded. SPI or SPII, 
signal peptidase I or II cleavage site, respectively. b, c, Growth of Bt wild 
type (WT) and surface GH mutants in media with amylopectin (n = 2, 
cell culture biological replicates) (b) or levan (n = 2, cell culture biological 
replicates) (c). See Extended Data Fig. 1 for additional independent 
experiments. d, e, Growth of Bt WT and surface GH mutants in mono- 
and co-culture in media with amylopectin (n = 2, cell culture biological 
replicates) (d) or levan (n = 2, cell culture biological replicates) (e). See 
Extended Data Fig. 1 for additional independent experiments. In all panels 
error bars represent standard error; P values derived from two-tailed 
Student’s t-test. 
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Figure 2 | B. ovatus does not require surface digestion for utilization 


of inulin. a, Predicted inulin utilization locus of B. ovatus (Bo). Gene 
designations shown are preceded by BACOVA_. The colour coding of 
gene products is as in Fig. 1. b, Thin layer chromatography (TLC) analysis 
of inulin defined medium incubated for the indicated times with purified 
His-tagged 04502, 04503 or 04502 and 04503, or vector control. See 
Supplementary Information Fig. 1 for uncropped scanned images. 

c, Growth of Bo WT and mutants in inulin defined medium. d, TLC 
analysis of conditioned media during the growth of Bo WT and mutants 
in 0.5% and 0.1% inulin. EL, early log; ML, mid log; LL, late log; 

St, stationary. e, Growth of Bo WT, A04502/3, A04504 (susD orthologue) 
and A04505 (susC orthologue) mutants in inulin defined medium. 

For ¢, e, each line represents n= 1 sample per condition. See 

Extended Data Fig. 2 for additional independent experiments. 

In all panels error bars represent standard error; P values derived 

from two-tailed Student's t-test. 


One of the key questions in evolutionary biology is how cooperative 
systems can be evolutionarily stable*>'4*!”. If certain cells invest in 
the production of an enzyme that helps others, what prevents these cells 
from being outcompeted by cells that consume the breakdown products 
without making the enzyme? In the B. thetaiotaomicron amylopectin 
and levan polysaccharide utilization systems, while receiving public 
goods from the wild type benefits the mutant cells (Fig. 1d, e, Extended 
Data Fig. le), they do not outcompete the wild type. Cells that make 
the enzyme receive more benefits than non-producing neighbouring 
cells. This observation suggests that a cell can utilize the majority of the 
polysaccharide that it breaks down, and that these private”* benefits 
are central to the evolutionary stability of extracellular polysaccharide 
digestions in these systems. 
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Figure 3 | Cost of inulin digestion by surface glycoside hydrolases. 

a, Growth yield of Bo WT and mutants on inulin agarose plates. Each 

line represents n= 1 sample per condition. See Extended Data Fig. 4 

for additional independent experiments. b, Bacteria per gram of faeces 

of gnotobiotic mice seven days after monocolonization with Bo WT or 
A04502/3 on a polysaccharide-free diet or supplemented with inulin 

(n=5 biological replicate mice per condition). c, Growth curves (upper 
and middle panels) and maximal A¢oo nm (bottom panel) of Bo WT or 
A04502/3 in minimal media with carbon sources as indicated. Each 

line represents n = 1 sample per condition. See Extended Data Fig. 4 for 
additional independent experiments. In the bottom panel, P values are 
displayed as paired (matched WT and A04502/3 within same experiment) 
followed by unpaired (all experiments) two-tailed Student's t-test (n =7, 
biological replicates). d, Growth of Bo WT or mutants in inulin or a 
stoichiometric equivalent amount of inulin breakdown products after 
digestion with purified 04502 and 04503 enzymes (n= 2, cell culture 
biological replicates); *P < 0.05; **P < 0.01; ***P < 0.001; see Extended 
Data Fig. 5 for additional independent experiments. In all panels error bars 
represent standard error; P values derived from two-tailed Student's t-test. 


We extended our analysis to another prominent member of the 
human Bacteroidales known to extensively utilize polysaccharides, 
Bacteroides ovatus (Bo)**. During growth on inulin, a dietary fructan 
known for health-promoting effects”°, Bo extracellularly digests and lib- 
erates considerable amounts of inulin breakdown products’. The pre- 
dicted inulin utilization locus of the Bo type strain ATCC 8483 encodes 
two similar outer surface glycoside hydrolases, BACOVA_04502 and 
BACOVA_04503 (Fig. 2a), both of which are predicted to target the 
81,2 inulin fructose polymer!. Both of these enzymes are required for 
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inulin breakdown (Fig. 2b). We therefore predicted that a single mutant 
of either BACOVA_04502 or BACOVA_04503 would be unable to grow 
on inulin. Surprisingly, neither of the single deletion mutants (A04502 
or A04503) nor the double mutant (A04502/3) demonstrated impaired 
fitness with inulin as the sole carbohydrate source (Fig. 2c, Extended 
Data Fig. 2a), even at limiting concentrations (Extended Data Fig. 2a). 

Given the importance of extracellular polysaccharide digestion 
for growth of Bacteroides on numerous polysaccharides!®* (Fig. 1, 
Extended Data Fig. 1), we predicted that Bo synthesizes other enzymes 
that breakdown inulin extracellularly, allowing the Bo mutants to grow 
on this polysaccharide. However, analysis of the growth media of 
A04502, A04503 and A04502/3 revealed no released inulin breakdown 
products, demonstrating that BACOVA_04502 and BACOVA_04503 
are solely responsible for extracellular digestion of inulin (Fig. 2d, 
Extended Data Fig. 2b; see Extended Data Fig. 2c for complemen- 
tation). BACOVA_04504 and BACOVA_04505 encode SusD and 
SusC orthologues respectively, which are predicted to function in 
inulin binding and the import of digestion products®”®. Deletion of 
BACOVA_04504 or BACOVA_04505 resulted in significant impair- 
ment of growth on inulin (Fig. 2e, Extended Data Fig. 3a; see Extended 
Data Fig. 3b for complementation). Growth of A04502, A04503 and 
A04502/3 in limiting concentrations of inulin revealed depletion of 
inulin (Fig. 2d, right panel; Extended Data Fig. 2b). Together, these 
data demonstrate that surface enzymes 04502 and 04503 are not needed 
for Bo to utilize inulin, and suggest that inulin is directly imported via 
04504-04505 without prior extracellular digestion. 

Why would Bo synthesize surface/secreted enzymes that potently 
digest inulin outside of the cell if not necessary for its growth on the 
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Figure 4 | Interspecies cooperation mediated by surface digestion 

of inulin is stabilized by reciprocal benefits. a, Germ-free mice 

were monocolonized with Bo WT or A04502/3 and maintained on a 
polysaccharide-free diet supplemented with inulin and housed for 2 weeks. 
Mice were then gavaged with the caecal contents of conventionally raised 
mice. At day 5 post gavage, caecal contents were plated for enumeration 

of Bo (n=5 biological replicate mice per group). b, Enumeration of Bv 

in monoculture or co-culture with Bo WT or A04502 on defined inulin 
plates. *P=0.001 for number of Bv in Bv/Bo WT co-culture vs Bv alone; 
+P=0.001 for number of Bv in Bv/Bo WT co-culture vs Bv/A04502 
co-culture; +P = 0.003 for number of Bv in Bv/A04502 co-culture vs Bv 
alone (n= 2, cell culture biological replicates). See Extended Data Figs 7 
and 9 for additional independent experiments. ¢, (left) Photos of patches of 
By plated at varying distances around Bo WT or Bo A04502/3 or alone on 
inulin agarose plates. (right) Enumeration of By after five days of culture 
plated at the same distance to Bo WT or Bo A04502/3 or alone on inulin 
plates. See Extended Data Fig. 9 for additional independent experiments. 

d, Enumeration of Bo WT or A04502 in monoculture or co-culture with By 
on defined inulin plates. P values correlate to colour of line/genotype used 
for statistical analysis. P value indicates comparison of monoculture and 
co-culture for the given condition at the time-point indicated. The benefit 
Bo receives from Bv is most robust when starting with fewer Bo. Depicted are 
starting CFU of ~10° Bo. (n=3, cell culture biological replicates at day 1, 2; 
n= 2, biological replicates at day 4). See Extended Data Fig. 7 for additional 
independent experiments and Extended Data Fig. 7b for starting CFU of 
~10’ Bo. e, Ratios of wild type and A04502/3 at the start and day 2 of culture 
when co-plated with or without Bv on inulin plates (n = 3, cell culture 
biological replicates). See Extended Data Fig. 9 for additional independent 
experiments. f, Ratios of WT and A04502/3 in the inoculum (day 0) and in 
faeces at various time points (days 4, 7, 11, 14, 18) post co-colonization of 
gnotobiotic mice (n =3 cell culture biological replicates) with Bo WT and 
A04502/3. Polysaccharide-free diet was supplemented with inulin at day 7. 
By was introduced at day 14. A Fisher exact test comparing the frequency of 
Bo WT and A04502/3 pre- (day 14) and post- (day 18) colonization with By 
was significant with a P value of 0.0001 for each individual mouse. Bo CFU 
in faeces were maximal after the switch to inulin diet and addition of Bv 
changed the abundance of Bo WT compared to A04502/3 but not total CFU 
of Bo. See Extended Data Fig. 9 for additional independent experiments. In 
all panels error bars represent standard error; P values displayed are derived 
from two-tailed Student's t-test. 


polysaccharide? A key evolutionary explanation for the release of secreted 
products by microbes is that they feed clonemates in a manner that is 
beneficial at the level of the clonal group”*!"!*!°. We hypothesized 
that the importance of extracellular digestion may be realized during 
spatially structured growth on plates where not all cells are in direct 
contact with the polysaccharide. However, mutant bacteria showed 
no significant differences in growth yield compared to wild type on 
defined inulin plates (Fig. 3a; Extended Data Fig. 4a). In addition, these 
enzymes were not required for optimal growth in vivo as wild type 
and A04502/3 showed equal colonization levels in gnotobiotic mice 
fed a polysaccharide-free diet with inulin added as the sole dietary 
polysaccharide (Fig. 3b). Therefore, we could find no evidence that 
inulin breakdown by 04502 and 04503 benefits Bo in three-dimensional 
growth or during monocolonization of the mammalian gut. 
Bacteroidales polysaccharide utilization loci can be induced by mon- 
omers or oligomers of the cognate polysaccharide’”®. This raised the 
possibility that 04502 and 04503 may be important for optimal growth 
on inulin during induction. However, although addition of trace 
amounts of fructose monomers or oligosaccharides led to accelerated 
growth on inulin (Fig. 3c, Extended Data Fig. 4b), this did not require 
04502/3 (Fig. 3c, Extended Data Fig. 4b). Rather than a benefit from 
the presence of the enzymes, we observed a cost based on reduced yield 
of the wild type compared to A04502/3 mutant during induced growth 
(Fig. 3c, Extended Data Fig. 4b) that was independent of a direct ener- 
getic cost of synthesis of 04502/3 (Figs 2c, e, 3c, d and Extended Data 
Figs 2a, 4b and 5a, b, c). We find that Bo grows better on undigested inulin 
than an equal concentration of inulin breakdown products (Fig. 3d, 
Extended Data Fig. 5a, b). In addition, the yield advantage to the 
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mutant does not occur under limiting inulin concentrations (Extended 
Data Fig. 5d,e). Furthermore, Bo preferentially consumes longer inu- 
lin digestion products over shorter oligomers and fructose (Fig. 2d, 
right panel, Extended Data Fig. 2b, right panel). Together, these data 
suggest that undigested inulin is the preferred substrate of Bo, and that 
extracellular digestion by 04502/3 under certain conditions is costly 
for fitness. 

We found no evidence that extracellular digestion of inulin by Bo 
evolved for cooperation with clonemates. Therefore, we speculated 
that this trait might have evolved for cooperation with other species 
in the gut. We first sought evidence of cooperation in the setting of 
a natural gut ecosystem. Germ-free mice were fed inulin and colo- 
nized with either Bo wild type or A04502/3 followed by the introduc- 
tion of the caecal microbiota of conventionally raised mice. Bo wild 
type and A04502/3 equally colonized mice before the introduction 
of the microbiota (Extended Data Fig. 6a), but Bo wild type received 
a significant fitness benefit compared to A04502/3 in the context ofa 
complex microbiota (Fig. 4a). These data suggested that, although not 
required for Bo to utilize inulin, 04502 and 04503 provide a benefit 
to Bo only realized in a community setting. The conditions for the 
evolution of cooperation between species are much more restrictive 
than those within a clone. In particular, theory predicts that costly 
interspecies cooperation will only be stabilized if there are reciprocal 
feedback benefits, such as a plant providing nectar for an insect that 
pollinates it?*’. From this experiment, we identified two dominant 
mouse microbiota Bacteroidales strains that thrived on Bo-derived inu- 
lin breakdown products (Extended Data Fig. 6b), with delayed growth 
on inulin (Extended Data Fig. 6b). These data suggest that cross-feeding 
Bacteroidales members may provide reciprocal benefits to wild-type 
Bo in the mammalian gut. 

To test experimentally for reciprocity and benefits of inulin digestion 
between species, we used an inulin co-culture system with Bo wild 
type or A04502 and Bacteroides vulgatus ATCC 8482 (By), which is 
commonly found together with Bo at high densities in humans'®'7 
and thrives on inulin breakdown products? but cannot use inulin!” 
Co-culture and proximate plating with Bo wild type increased the fit- 
ness of Bv compared to that with A04502 (Fig. 4b, c and Extended Data 
Figs 7a, 9a); however, Bv is able to persist better with A04502 than 
when alone (Fig. 4b, Extended Data Fig. 7a), owing to a small 
(<2,000 Da) secreted molecule(s) that contributes to the survival of Bv 
(Extended Data Fig. 8a, b). This 04502/3-independent survival is not 
mediated by a universal factor made by Bacteroides during growth 
on inulin nor Bo-derived short-chain fatty acids (Extended Data 
Fig. 8a, c, d). Thus, there are multiple mechanisms by which Bo helps 
By (Fig. 4b, Extended Data Fig. 7a), the greatest being cross-feeding 
mediated by 04502/3. 

We next addressed the question of whether Bo receives reciprocal 
benefits from Bv. Co-culture of Bv with Bo on plates increased the 
fitness of Bo wild type and Bo A04502 (Fig. 4d, Extended Data Fig. 7b), 
but did not increase the fitness of B. fragilis (Extended Data Fig. 7c). 
If inulin breakdown can be costly, and Bo receives benefits from Bv 
irrespective of whether inulin is broken down and fed to By, natural 
selection is expected to favour the loss of the genes encoding the 
secreted inulin glycoside hydrolases”. However, our pairwise exper- 
iments would not reveal the possibility that Bo wild type receives 
more reciprocal benefits from Bv when in direct competition with 
the non-cross-feeding mutant”!”’. Therefore, we co- and tri-cultured 
these strains on plates and compared the yields of the two Bo strains 
(wild type and A04502/3) with or without Bv. Addition of Bv leads to an 
increased proportion of Bo wild type compared to the A04502/3 mutant 
(Fig. 4e, Extended Data Fig. 9b). We extended these studies to a gnotobiotic 
mouse colonization model. Bo wild type had no advantage in direct com- 
petition with A04502/3 on a polysaccharide-free diet or when inulin 
is added (Fig. 4f). However, introduction of Bv increased the fitness of 
the Bo wild type relative to the mutant (Fig. 4f, Extended Data Fig. 9c, d). 
Together, these data suggest that extracellular breakdown of inulin 
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increases the fitness of Bo via reciprocal benefits from another species. 
These findings are consistent with the evolution of cooperation between 
species within the gut microbiota. 

We find evidence of distinct forms of cooperativity within the 
Bacteroidales of the human intestinal microbiota (Extended Data 
Fig. 10). For B. thetaiotaomicron, amylopectin and levan digestion provide 
mostly private benefits and modest social benefits to other cells. By 
contrast, Bo releases large amounts of inulin digestion products via 
a pair of dedicated cross-feeding secreted enzymes unnecessary for 
its use of inulin. These enzymes allow for cooperation with cross-fed 
species, which provide benefits in return. Potential mechanisms by 
which Bv may provide return benefits to Bo include detoxification of 
inhibitory substances, or production of a depleted or growth promoting 
factor, the latter supported by early growth benefits to Bo via Bv secreted 
factors (Extended Data Fig. 9e). 

Understanding whether microbial communities are formally cooper- 
ative is central to predicting their evolutionary and ecological stability. 
Cooperative systems can be productive, but are prone to instabilities 
on both ecological and evolutionary timescales that can undermine 
them*!*!5?122, The ability of one species to utilize the waste product 
of another is prevalent, but waste production alone does not signify 
cooperative evolution. As opposed to waste product utilization or 
exploitive interactions?® *°, there are few well-documented cases of 
evolved cooperation between microbial species where one species 
evolves to help another'*. We have found evidence of strong eco- 
evolutionary interactions within the microbiota that are likely to be 
central to the functioning of these complex communities. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 


Received 4 May 2015; accepted 24 February 2016. 
Published online 25 April 2016. 


1. Sonnenburg, E. D. et al. Specificity of polysaccharide use in intestinal 
bacteroides species determines diet-induced microbiota alterations. 
Cell 141, 1241-1252 (2010). 

2. West, S.A, Griffin, A. S., Gardner, A. & Diggle, S. P. Social evolution theory for 
microorganisms. Nature Rev. Microbiol. 4, 597-607 (2006). 

3. Rakoff-Nahoum, S., Coyne, M. J. & Comstock, L. E. An ecological network of 
polysaccharide utilization among human intestinal symbionts. Curr. Biol. 

24, 40-49 (2014). 

4. Drescher, K., Nadell, C. D., Stone, H. A., Wingreen, N. S. & Bassler, B. L. 
Solutions to the public goods dilemma in bacterial biofilms. Curr. Biol. 24, 
50-55 (2014). 

5. Frank, S. A. A general model of the public goods dilemma. J. Evol. Biol. 23, 
1245-1250 (2010). 

6. Koropatkin, N. M., Cameron, E. A. & Martens, E. C. How glycan metabolism 
shapes the human gut microbiota. Nature Rev. Microbiol. 10, 323-335 
(2012). 

7. Subramanian, S. et al. Cultivating healthy growth and nutrition through the gut 
microbiota. Cel/ 161, 36-48 (2015). 

8. Shipman, J. A., Cho, K. H., Siegel, H. A. & Salyers, A. A. Physiological 
characterization of SusG, an outer membrane protein essential for starch 
utilization by Bacteroides thetaiotaomicron. J. Bacteriol. 181, 7206-7211 
(1999). 

9. Littman, D. R. & Pamer, E. G. Role of the commensal microbiota in normal 
and pathogenic host immune responses. Cel! Host Microbe 10, 311-323 
(2011). 

10. Waldor, M. K. et al. Where next for microbiome research? PLoS Biol. 13, 

e1002050 (2015). 

11. Koschwanez, J. H., Foster, K. R. & Murray, A. W. Sucrose utilization in budding 
yeast as a model for the origin of undifferentiated multicellularity. PLoS Biol. 9, 
e1001122 (2011). 

2. Costello, E. K., Stagaman, K., Dethlefsen, L., Bohannan, B. J. M. & Relman, D. A. 
The application of ecological theory toward an understanding of the human 
microbiome. Science 336, 1255-1262 (2012). 

3. Estrela, S., Whiteley, M. & Brown, S. P. The demographic determinants of 
human microbiome health. Trends Microbiol. 23, 134-141 (2015). 

4. Mitri, S. & Foster, K. R. The genotypic view of social interactions in microbial 
communities. Annu. Rev. Genet. 47, 247-273 (2013). 

5. Oliveira, N. M., Niehus, R. & Foster, K. R. Evolutionary limits to cooperation 
in microbial communities. Proc. Nat! Acad. Sci. USA 111, 17941-17946 
(2014). 

6. Faith, J. J. et al. The long-term stability of the human gut microbiota. Science 
341, 1237439 (2013). 


© 2016 Macmillan Publishers Limited. All rights reserved 


17. 


18. 


19. 


20. 


21. 


22. 


23. 


24. 


25. 


26. 


27. 


28. 


Zitomersky, N. L., Coyne, M. J. & Comstock, L. E. Longitudinal analysis of the 
prevalence, maintenance, and IgA response to species of the order 
Bacteroidales in the human gut. Infect. Immun. 79, 2012-2020 (2011). 
Flint, H. J., Bayer, E. A., Rincon, M. T., Lamed, R. & White, B. A. Polysaccharide 
utilization by gut bacteria: potential for new insights from genomic analysis. 
Nature Rev. Microbiol. 6, 121-131 (2008). 

Elhenawy, W., Debelyy, M. O. & Feldman, M. F. Preferential packing of acidic 
glycosidases and proteases into Bacteroides outer membrane vesicles. 
mBio 5, e00909-14 (2014). 
Cuskin, F. et al. Human gut Bacteroidetes can utilize yeast mannan through 
a selfish mechanism. Nature 517, 165-169 (2015). 


Sachs, J. L., Mueller, U. G., Wilcox, T. P. & Bull, J. J. The evolution of cooperation. 


Q. Rev. Biol. 79, 135-160 (2004). 

Foster, K. R. & Wenseleers, T. A general model for the evolution of mutualisms. 
J. Evol. Biol. 19, 1283-1293 (2006). 

Gore, J., Youk, H. & van Oudenaarden, A. Snowdrift game dynamics and 
facultative cheating in yeast. Nature 459, 253-256 (2009). 

Martens, E. C. et al. Recognition and degradation of plant cell wall 
polysaccharides by two human gut symbionts. PLoS Biol. 9,e1001221 
(2011). 

Orel, R. & Kamhi Trop, T. Intestinal microbiota, probiotics and prebiotics in 
inflammatory bowel disease. World J. Gastroentero/. 20, 11505-11524 
(2014). 

Cameron, E. A. et a/. Multifunctional nutrient-binding proteins adapt human 
symbiotic bacteria for glycan competition in the gut by separately promoting 
enhanced sensing and catalysis. mBio 5, e01441-14 (2014). 

Momeri, B., Waite, A. J. & Shou, W. Spatial self-organization favors heterotypic 
cooperation over cheating. Elife 2, 00960 (2013). 

Ng, K. M. et a/. Microbiota-liberated host sugars facilitiate post-antobiotic 
expansion of enteric pathogens. Nature 502, 96-99 (2013). 


LETTER 


29. Degnan, P. H., Barry, N. A., Mok, K. C., Taga, M. E. & Goodman, A. L. Human gut 
microbes use multiple transporters to distinguish vitamin Bi2 analogs and 
compete in the gut. Cel/ Host Microbe 15, 47-57 (2014). 

30. Fischbach, M. A. & Sonnenburg, J. L. Eating for two: how metabolism 
establishes interspecies interactions in the gut. Cel! Host Microbe 10, 336-347 
(2011). 


Supplementary Information is available in the online version of the paper. 


Acknowledgements We thank C. B. Ogbunugafor, J. Ordovas-Montanes, 

U. von Andrian and M. Waldor for suggestions. M. Delaney for SCFA analysis, 

V. Yeliseyev for assistance with gnotobiotics. Inulin and FOS were provided 

by Beneo-Orafti. Mice were provided by the HDDC, NIH Grant P30 DK34845. 
S.R.-N. is supported by the PIDS-St Jude Research Hospital Fellowship 
Program in Basic Research, a K12 Child Health Research Center grant through 
Boston Children’s Hospital and a Pilot Feasibility Award funded by HDDC P30 
DK034854. K.R.F. is supported by European Research Council Grant 242670. 
This work was supported by Public Health Service grant RO1AI081843 

(to L.E.C.) from the NIH/NIAID. 


Author Contributions S.R.-N. performed mutant construction bacterial cultures, 
gnotobiotic experiments, protein purification and TLC, L.E.C. assisted with 
mutant construction. S.R.-N. and L.E.C. analysed the data. S.R.-N., K.R.F. and 
L.E.C. designed the study and wrote the paper. 


Author Information Reprints and permissions information is available at 
www.nature.com/reprints. The authors declare no competing financial 
interests. Readers are welcome to comment on the online version of the 
paper. Correspondence and requests for materials should be addressed to 
S.R.-N. (seth.rakoff-nahoum@childrens.harvard.edu). 


12 MAY 2016 | VOL 533 | NATURE | 259 


© 2016 Macmillan Publishers Limited. All rights reserved 


LETTER 


METHODS 

Bacterial strains and media. Bacteroidales type strains used in this study are Bo 
ATCC 8483, B. thetaiotaomicron VPI 5482 Bv ATCC 8482, and B. fragilis NCTC 
9343. Bacteria were grown in media formulation as previously described’. 
Bacterial culture. For growth in defined media, bacteria were inoculated from 
brain heart infusion plates containing haemin and vitamin K (BHIS) plates into 
basal medium (BS), cultured overnight to stationary phase, then diluted 1:10 in 
fresh BS and grown to mid log. At mid log, bacteria were pelleted by centrifugation 
and washed with sterile phosphate buffered saline (PBS) and then inoculated in 
defined media. Carbohydrates used to supplement defined media include fruc- 
tose (F2543, Sigma), fructose oligosaccharides (FOS; OraftiP95, Beneo-Orafti 
group), levan (L8647, Sigma), amylopectin (10120, Sigma), and inulin (OraftiHP, 
Beneo-Orafti group). Levan and amylopectin were autoclaved as 1% w/v in H,O 
and dialysed using 3.5 kDa MW membranes (Slide-A-Lyzer Dialysis Cassettes, 
ThermoScientific). Short chain fatty acids acetate, propionate and succinate were 
purchased from Sigma. Stock solutions of 2mM were pH neutralized to pH 7.2-7.3 
with 10N NaOH. All cultures were grown at 37°C under anaerobic conditions. 
Bacterial growth was quantified by absorbance (Agoonm) using 200 ul of bacterial 
culture in 96-well flat-bottom microtitre plates using a Powerwave spectropho- 
tometer (Biotek). Murine gut Bacteroidales from the caecal preparations used in 
for the colonization experiments were grown on BHIS plates. Resulting colonies 
were tested for growth in inulin minimal medium or inulin breakdown products 
from the conditioned media of Bo grown in inulin, containing inulin breakdown 
products as previously described’. 

Creation of Bacteroides mutants. Deletion mutants were created whereby the 
genes encoding BT3698 or BT 1760 in Bacteroides thetaiotaomicron VPI 5482, 
BACOVA_04502 in Bo ATCC 8483 BACOVA_04503, BACOVA_04502/3, 
BACOVA_04504, or BACOVA_04505 in Bo ATCC 8483 were removed. DNA seg- 
ments upstream and downstream of the region to be deleted were PCR amplified 
using the primers outlined in Supplementary Table 1. PCR products were digested 
with BamHI, EcoRI and/or Mlul engineered into the primers (Supplementary 
Information Table 1) and cloned by three-way ligation into the appropriate site 
of pNJR6 (ref. 31). The resulting plasmids were conjugally transferred into the 
Bacteroides strain as indicated using helper plasmid R751 and cointegrates were 
selected by erythromycin resistance. Cross outs were screened by PCR for the 
mutant genotype. 

Cloning of PUL genes for expression in deletion mutants. BACOVA_04502, 
BACOVA_04503, BACOVA_04502/3, BACOVA_04504, BACOVA_04505 
or BACOVA_04504/5 genes were PCR amplified using the primers listed in 
Supplementary Table 1. The PCR products were digested and ligated into the 
BamHI or KpnI site of the Bacteroides expression vector pED340*. Plasmids 
containing the correct orientation of the insert in relation to the vector-borne 
promoter were introduced into mutant Bacteroides strains by mobilization from 
E. coli using helper plasmid RK231. 

Mono-, co- and tri- culture experiments. For bacterial mono and co-culture 
experiments in defined liquid media, bacteria were grown as indicated for mono- 
culture before addition to the defined media. Sterile magnetic stir bars were added 
to culture tubes within a rack placed on a stir plate within the anaerobic chamber. 
For conditioned media experiments, Bo WT, Bo A04502, Bo A04502/3, By, or Bf 
were grown to early log in inulin defined media or 0.125% fructose defined media 
(for Extended Data Fig. 9e), conditioned media were collected, filter sterilized, and 
incubated at 37°C for 72h. Conditioned media was replenished with defined media 
without additional carbohydrate and used for cultivation of Bv. Bo A04502/3 con- 
ditioned media was dialysed in defined media without carbohydrate using 2 kDa 
MW membranes (Slide-A-Lyzer Dialysis Cassettes, ThermoScientific). For mono- 
culture of Bo WT and mutants on solid agarose, 41] of the indicated concentration 
of bacteria were dotted onto minimal inulin agarose plates. At the indicated time 
points, the bacteria were cut out, diluted and plated onto BHIS for CFU enumera- 
tion. For co-plating experiments, 10° Bo (WT or mutant) or Bf were co-plated with 
10° Bv or a control volume of PBS. Four 1l were then dotted on to defined inulin 
agarose plates. At the indicated time points, the dotted patches were cut from the 
agarose plates and resuspended in PBS, diluted and plated to BHIS for enumera- 
tion. Quantification and differentiation of wild type and isogenic glycoside hydro- 
lase or polysaccharide lyase mutant was performed by plating dilutions of mixed 
liquid culture or the cut-out patch on agarose plates onto BHIS, followed by picking 
~100 colonies and determining wild type or mutant genotype by PCR using prim- 
ers listed in Supplementary Table 1. For genotypic screening of B. thetaiotaomicron 
wild type and B. thetaiotaomicron A3698, two sets of primers were used 
(Supplementary Table 1). For Bo/Bv co-culture in Fig. 4b, c and Extended Data 
Fig. 7, Bo A03533 (WT) and Bo A03533 A04502 (A04502) arginine auxotrophic 
mutants were used in co-culture with Bv on minimal inulin agarose plates sup- 
plemented with 501g ml“! of arginine (Sigma) which does not impair or limit 


growth as compared to wild type*. Colonies on BHIS plates were replica plated 
onto defined glucose defined plates, which support the growth of Bv but not Bo 
A03533 or Bo A03533 A04502. 

Thin-layer chromatography. Thin-layer chromatography (TLC) was employed 
to specifically detect carbohydrates as previously described*. Standards for TLC 
included glucose (G7528, Sigma), fructose (F2543, Sigma), fructose oligosaccha- 
rides (FOS; OraftiP95, Beneo-Orafti group) and inulin (OraftiHP, Beneo-Orafti 
group). See Supplementary Information Fig. 1 for uncropped TLCs. 

Gas chromatographic analysis of culture media. Chromatographic analysis was 
carried out using a Shimadzu GC14-A system with a flame ionization detector 
(FID) (Shimadzu Corp, Kyoto, Japan). A volatile acid mix containing 10 mM of 
acetic, propionic, isobutyric, butyric, isovaleric, valeric, isocaproic, caproic, and 
heptanoic acids was used (Matreya, Pleasant Gap PA). A non-volatile acid mix 
containing 10mM of pyruvic and lactic acid and 5 mM of oxaloacetic, oxalic, methyl 
malonic, malonic, fumaric, and succinic acid was used (Matreya, Pleasant Gap PA). 
Cloning, purification, and enzymatic analysis of BACOVA_04502-3. To obtain 
purified BACOVA_04502 and BACOVA_04503 proteins, these genes were cloned 
individually into the BamHI site of pET16b (Novagen) using the primers listed in 
Supplementary Table 1. The constructs were designed so that the His-tag encoded 
by pET16b replaced the SpII signal sequence of these proteins allowing for their 
solubility. The recombinant plasmids were transformed into E. coli BL21 (DE3), 
grown to an A¢oonm Of 0.6-0.7, and expression of the recombinant gene was induced 
by the addition of 0.4mM IPTG for an additional 4h. The His-tagged proteins were 
isolated essentially as described? using Dynabeads TALON paramagnetic beads. 
For enzymatic analysis (Fig. 2a) the proteins were added to inulin media in their 
magnetic bead-bound form. This allows for easy removal of the enzymes following 
digestion. As a control, the beads resulting from the same procedure performed 
with E. coli BL21 (DE3) containing only the vector (pET16b) were used. For Bo 
growth assays (Fig. 3d), 5011 of beads (2511 containing His-04502 and 25 11 con- 
taining His-04503) or equivalent volume of Dynabead buffer (for undigested inulin) 
were added to inulin defined medium. After 24h at 37°C, the beads containing 
the enzyme were removed with a magnet, and the media (digested or undigested 
inulin) was used to culture Bo WT and A04502. 

Gnotobiotic mouse experiments. All animal experiments were approved by the 
Harvard Medical School IACUC. Swiss Webster germ-free male mice (6-10 weeks 
old) were purchased from the Harvard Digestive Diseases Gnotobiotic Core facility. 
Littermates were randomly allocated for different gnotobiotic experimental arms. 
Experiments were conducted in either sterile Optima cages (Figs 3b, 4a, Extended 
Data Fig. 9d) or gnotobiotic isolators (Fig. 4f). When appropriate, animals were 
numbered by sterile tail markers for longitudinal analysis and were not blinded. 
Number of animals used for experimentation was determined by precedence 
for gnotobiotic studies. Longitudinal analysis in the experiment of Fig. 4f of the 
ratios of wild type to mutant Bo under changing environmental conditions allowed 
for internally-controlled, within-individual comparisons. Mice were placed on 
polysaccharide-free special chow (65% w/v glucose, protein-free, supplemented 
with all essential amino acids except arginine; BioServe). Arginine at 50g ml“! 
was supplemented in all drinking water. As indicated, mice were given 1% inulin 
(w/v) in sterile drinking water. Mice were inoculated with the indicated bacteria 
by applying ~108 live bacteria (grown to mid log) onto mouse fur. Dilutions of 
faeces at various time points following inoculation were plated to BHIS plates and 
genotyped as above (Fig. 4f). Predetermined exclusion criteria for gnotobiotic 
experiments contamination as determined by either the presence of colonies with 
distinct morphology on anaerobic plates than Bo or Bv or colonies present at >10? 
CFU ml"! (limit of detection) under aerobic conditions. 

For gavage of Bo WT or A04502/3 monocolonized mice with the caecal con- 
tent of conventionalized raised mice, two ~ 8-week-old male Swiss Webster mice 
purchased from Taconic and housed in a specific pathogen-free facility were 
euthanized under sterile conditions. Intestine was excised and care was made to 
leave caecum intact. Within 2 min of excision, the caeca were transferred into 
an anaerobic chamber where the caecal contents were pooled and diluted with 
~10 ml of pre-reduced phosphate buffered solution supplemented with 0.1% 
cysteine. Contents were vigorously vortexed and immediately divided equally into 
two conical flasks (one for each group of mouse) to ensure equality of suspension 
between both conical flasks. Tightly sealed conical flasks were transferred to the 
mouse facility at which point 2001] were gavaged to each mouse housed in sterile 
Optima cages. Caecal contents were plated to LKV (Remel) plates for enrichment 
of Bacteroides. Different colony morphologies were assigned species by 16S PCR 
as previously described'’. For each group of mice, mice were housed in cages of 
2 and 3 mice per cage. At time of killing, caecal contents were collected and plated 
to defined inulin plates by which Bo was enumerated based on distinct colony 
morphology of Bo on inulin agarose plates that was not present in conventionally 
raised caecal population. 
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Statistical analysis. The experiments were not randomized, and the investigators 
were not blinded to allocation during experiments and outcome assessment. No 
statistical methods were used to predetermine sample size. Replicate experiments 
are shown in Extended Data. All P values are derived from Student's t-test except 
as indicated in Fig. 4f and Extended Data Fig. 9c where Fisher’s exact test was 
performed. Statistical significance of variance reported as indicated per experi- 
ment in figure legends. All centre values are mean. Error bars are standard error 
of the mean. 
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Extended Data Figure 1 | Limited cooperation during polysaccharide 
utilization by B. thetaiotaomicron. a—d, Independent experiments for 
Fig. 1b-e. a, b, Upper panels n= 2 biological replicates, lower panels 
n=1;¢,d, n=2 biological replicates. e, TLC analysis of conditioned 
media from B. thetaiotaomicron grown in amylopectin (left panel) or 
levan (right panel) minimal media. EL, early log; ML, late log; LL, late 
log; Stat, stationary phase; Glu, glucose; Fru, fructose; Suc, sucrose. 

See Supplementary Information Fig. 1 for uncropped scanned images. 
In all panels, error bars represent standard error; P values derived from 


two-tailed Student’s t-test. 
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Extended Data Figure 2 | Bo 04502 and 04503 mutants grow 
equivalently to wild type on limiting concentrations of inulin and do 
not require surface digestion for utilization of inulin. a, Growth of 

Bo WT, A04502, A04503 and A04502/3 mutants in varying concentrations 
of inulin as indicated. Biological replicates of each condition are plotted 

as individual lines (n =2 cell culture biological replicates). Upper and 
lower panels are independent experiments. b, Independent experiments 
for Fig. 2d. EL, early log; ML, mid log; LL, late log; St, stationary phase. 


See Supplementary Information Fig. 1 for uncropped scanned images. 
c, Complementation of Bo A04502 and A04502/3 mutants with the 
respective genes in trans. TLC analysis of conditioned media from 

Bo A04502/3 (left panel) complemented in trans with BACOVA_04502, 
BACOVA_04503, BACOVA_04502/3 or vector alone (pFD340) and 

Bo A04502 (right panel) with BACOVA_04502 or vector alone grown 
in defined inulin media. See Supplementary Information Fig. 1 for 
uncropped scanned images. S, stationary phase. 
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Extended Data Figure 3 | SusC and SusD homologues BACOVA_04505 
and BACOVA_04505 are required for inulin utilization. a, Independent 
experiments for Fig. 2e. Left panel n = 2 biological replicates, right panel 
each line represents n= 1 sample per condition. b, Complementation of 
Bo A04504, Bo A04505 and A04504/5 mutants with the genes in trans. 
Growth of Bo A04504, Bo A04505, Bo A04504/5 with BACOVA_04504, 
BACOVA_04505, BACOVA_04504/5 or vector alone (pFD340) in trans in 
defined inulin media. Each line represents n = 1 sample per condition. In 
all panels, error bars represent standard error; P values derived from 
two-tailed Student’s t-test. 
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Extended Data Figure 4 | Costs of extracellular inulin digestion by 

B. ovatus. a, b, Independent experiments for Fig. 3a, c. n = 3 biological 
replicates at day 1, 2; n= 2, biological replicates at day 4 (a). In upper and 
upped middle panels biological replicates of each condition are plotted as 
individual lines (n =2 cell culture biological replicates); in lower middle 
and lower panels each line represents n = 1 sample per condition (b). 


c, TLC analysis of conditioned media from Bo WT and Bo A04502/3 
cultured in 0.5% inulin with trace (0.06%) amounts of fructose. 

St, stationary phase. See Supplementary Information Fig. 1 for uncropped 
scanned images. In all panels, error bars represent standard error; P values 
derived from two-tailed Student's t-test. 
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of Fig. 3d. b, Additional independent experiments for Fig. 3d. Each line independent experiment. Each condition is plotted as individual lines in 
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Extended Data Figure 6 | A complex mouse microbiota differentially 
affects B. ovatus WT and A04502/3 pre-colonized gnotobiotic mice 
and analysis of cross feeding of the predominant murine Bacteroidales 
of the murine gut microbiota. a, Germ-free mice were monocolonized 
with Bo WT or A04502/3 and maintained on a diet supplemented with 
inulin as the sole polysaccharide and housed under gnotobiotic conditions 
for 2 weeks. Bacteria were enumerated from faeces before gavage with 
intestinal microbiota of conventionally raised mice (n =5 mice, cell culture 
biological replicates). b, Growth of two dominant mouse microbiota 
Bacteroidales strains (Parabacteroides goldsteinii and a strain with 96% 
16S rRNA gene identity to B. thetaiotaomicron) with inulin breakdown 
products derived from the conditioned media of B. ovatus grown in inulin 
(all inulin had been digested) or undigested inulin minimal media. Each 
data point is a different isolate of the indicated species from the caeca of 
the conventionally raised mice used for gavage (n = 11 isolates for upper 
panel, n =6 for lower panel). In all panels, error bars represent standard error. 
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Extended Data Figure 7 | The B. ovatus, but not B. fragilis benefits 
from B. vulgatus in co-culture in inulin. a, b, Independent experiments 
for Fig. 4b, d. The left panel corresponds to starting culture with ~107 
CFU Bo corresponding to starting culture of ~10° CFU Bo in Fig. 4d. 

The two right panels are a duplicate pair of experiments of starting CFU 
of 10° and 10’. In each panel n =2 biological replicates. c, Enumeration of 
B. fragilis in monoculture or co-culture with Bv on defined inulin plates, 
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n=3 cell culture biological replicates. Letters in parentheses refer to values 
correlating to colour of line used for statistical analysis. In a, for example, 
p (g,r) refers to comparison of values of green (g) and red (r) values at 

the time-point indicated. In b, colour of P value indicates comparison 

of monoculture and co-culture for the given condition at the time-point 
indicated. For all panels, error bars represent standard error; P values 
derived from two-tailed Student's t-test. 
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Extended Data Figure 8 | Secreted factors from Bo and isogenic 
mutants, but not B. fragilis (Bf), support Bv survival. a, Growth of 
Bv in conditioned media from Bo WT, A04502, A04502/3 or Bf grown in 
defined media with inulin as the sole carbohydrate and in inulin media. 
End time-point corresponded to peak growth of Bv in conditioned media 
derived from Bo WT. B. fragilis, which utilizes inulin’? but similar to 
Bo A04502/3 does not liberate inulin breakdown products*, does not 
support the survival of Bv during co-culture. Left and right panels are 
independent experiments. Left panel; start, n = 4 biological replicates; 
end, n = 2 biological replicates. Right panel; 10, n= 4 cell culture biological 
replicates; 113 and 24, n= 2 biological replicates. b, Growth of Bv in 
dialysed (2kDa MW membrane) or undialysed conditioned media from 
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Bo A04502/3 grown in inulin, n = 2 cell culture biological replicates. 

c, Gas chromatographic analysis of acetate, propionate and succinate in 
conditioned media during growth of Bo WT and A04502/3 in defined 
media with inulin as the sole carbohydrate. Other volatile and non-volatile 
substances (as listed in Methods) were undetectable, n = 2 cell culture 
biological replicates, except A04502/3 stationary phase, n = 1. d, Growth 
of By in defined medium with inulin as the sole carbohydrate with or 
without addition with 15 mM of acetate, propionate or succinate, t=0, 
n= 4 biological replicates; t= 13 and 24h, n = 2 biological replicates. 

CM, conditioned media; UD, undetected. For all panels, error bars 
represent standard error; P values derived from two-tailed Student's t-test. 
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Extended Data Figure 9 | Spatial aspects of mutualism between Bo and 
By via cross feeding and interspecies cooperation in vivo via 04502/3. 
a, Independent experiments for Fig. 4c, n = 4 biological replicates. 

b, Independent experiments for Fig. 4e, n = 3 cell culture biological 
replicates. c, Scatter plot of experiment in Fig. 4f, n = 3 cell culture 
biological replicates. d, Ratios of wild type and A04502/3 in faeces 21 
days after co-colonization of three germ-free mice (n = 3 mice biological 
replicates) on a diet of inulin as the sole dietary polysaccharide (pre-Bv) 
and then four days after introduction of By (post-Bv). Each panel shows 
the ratio pre- and post- Bv of an individual mouse. P values are Fisher 
exact test comparing the frequency of Bo WT and A04502/3 pre- and 
post- colonization with Bv for each individual mouse. At day 21, all 
mice were colonized with a higher ratio of the mutant (ranging to 86% 
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in the mouse depicted in the lowest panel), with each mouse showing a 
statistically significant increase in the proportion of the wild type after 
introduction of Bv. e, Growth of B. ovatus in 0.5% inulin (upper panel) 
or fructose (lower panel) in minimal media to which filter sterilized 
conditioned media from early log, A¢oo nm matched growth of Bv or Bo in 
0.125% fructose minimal media or fresh 0.125% fructose minimal media 
control was added at 1:1 ratio. In e, numbers refer to P values (*P < 0.05, 
**P < (0.01, ***P < 0.001) of comparison of values of green (g), red (r) or 
black (b) values by unpaired, two-tailed student t-test at the time-point 
indicated, n =2 cell culture biological replicates. For all panels, error bars 
represent standard error; for all panels except d, P values derived from 
two-tailed Student's t-test. 
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Extended Data Figure 10 | Schematic of forms of cooperativity via 
polysaccharide digestion among Bacteroidales. a, Limited cooperation. 
Privatization of extracellularly digested public goods by the individual 
performing the digestion leads to greater individual (thick arrow) than 
shared benefits (thin arrow) as seen in Bt during growth on levan and 
amylopectin. b, Cooperation between species is seen between Bo and Bv 
during growth on inulin. Surface digestion of inulin by Bo creates 
breakdown products that it does not need to grow on inulin. Rather, inulin 
breakdown represents a dedicated cross-feeding system that provides 
benefits to Bv, with reciprocal fitness benefits to Bo. 
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Molecular mechanism of APC/C activation by 


mitotic phosphorylation 


Suyang Zhang", Leifu Chang", Claudio Alfieri', Ziguo Zhang!, Jing Yang', Sarah Maslen!, Mark Skehel! & David Barford! 


In eukaryotes, the anaphase-promoting complex (APC/C, also 
known as the cyclosome) regulates the ubiquitin-dependent 
proteolysis of specific cell-cycle proteins to coordinate chromosome 
segregation in mitosis and entry into the G1 phase)”. The catalytic 
activity of the APC/C and its ability to specify the destruction of 
particular proteins at different phases of the cell cycle are controlled 
by its interaction with two structurally related coactivator subunits, 
Cdc20 and Cdh1. Coactivators recognize substrate degrons*, and 
enhance the affinity of the APC/C for its cognate E2 (refs 4-6). 
During mitosis, cyclin-dependent kinase (Cdk) and polo-like 
kinase (Plk) control Cdc20- and Cdh1-mediated activation of 
the APC/C. Hyperphosphorylation of APC/C subunits, notably 
Apcl and Apc3, is required for Cdc20 to activate the APC/C7"!?, 
whereas phosphorylation of Cdh1 prevents its association with the 
APC/C*!!4, Since both coactivators associate with the APC/C 
through their common C-box"> and Ile-Arg tail motifs!®!’, the 
mechanism underlying this differential regulation is unclear, as 
is the role of specific APC/C phosphorylation sites. Here, using 
cryo-electron microscopy and biochemical analysis, we define the 
molecular basis of how phosphorylation of human APC/C allows 
for its control by Cdc20. An auto-inhibitory segment of Apcl acts as 
a molecular switch that in apo unphosphorylated APC/C interacts 
with the C-box binding site and obstructs engagement of Cdc20. 
Phosphorylation of the auto-inhibitory segment displaces it from 
the C-box-binding site. Efficient phosphorylation of the auto- 
inhibitory segment, and thus relief of auto-inhibition, requires 
the recruitment of Cdk-cyclin in complex with a Cdk regulatory 
subunit (Cks) to a hyperphosphorylated loop of Apc3. We also find 
that the small-molecule inhibitor, tosyl-L-arginine methyl ester, 
preferentially suppresses APC/C“™ rather than APC/C“"!, and 
interacts with the binding sites of both the C-box and Ile-Arg tail 
motifs. Our results reveal the mechanism for the regulation of 
mitotic APC/C by phosphorylation and provide a rationale for the 
development of selective inhibitors of this state. 

To understand how multi-site phosphorylation of numerous APC/C 
subunits stimulates the capacity of Cdc20 to control the APC/C we 
determined a series of APC/C structures in different functional states to 
near-atomic resolution (Extended Data Table 1a). We used the kinases 
Cdk2-cyclin A3—Cks2 and Polo (Plk1) to phosphorylate in vitro recom- 
binant human APC/C!!!*!8!9 (Extended Data Fig. 1a), obtaining 
APC/C in the mitotic state that can be activated by Cdc20 (Extended 
Data Fig. 1b, lanes 9, 10). This reconstituted APC/C recapitulates Cdk- 
and Plk1-dependent activation of endogenous APC/ C420 (refs 8-12). 
Kinase treatment resulted in a complete upshift of the Apc3 subunit 
as visualized on SDS-PAGE, indicative of stoichiometric phosphoryl- 
ation (Extended Data Fig. la, c). Almost 150 phosphorylation sites 
were identified in phospho-APC/C by mass spectrometry (Extended 
Data Tables 2 and 3), matching published data!?2°-22. These sites lie 
within disordered regions of the APC/C”’. Incubating the APC/C with 
both Cdk2-cyclin A3-Cks2 and Plk1 simultaneously was necessary to 


obtain full activation (Extended Data Fig. 1b). Consistent with ref. 12, 
treatment with Cdk2-cyclin A3-Cks2 alone resulted in lower APC/C 
activation, whereas phosphorylation with Plk1 alone did not activate 
the APC/C. 

To gain insights into the molecular interactions between Cdc20 and 
mitotic APC/C, a ternary complex was assembled using phosphoryl- 
ated APC/C, Cdc20 and a high-affinity substrate Hsl1 (APC/C°420"#s!!) 
(Extended Data Fig. 1d) for cryo-EM analysis (Fig. 1a, b, Extended Data 
Fig. le-g and Extended Data Table 1). Owing to the low occupancy of 
Cdc20 bound to the APC/C, combined with conformational hetero- 
geneity, only 9% of the particles were used for the final reconstruction 
of APC/CC4°20-Hs!l with the remainder being in either the apo state 
(72%) or in a hybrid state (Extended Data Fig. 2). Most static regions 
of the complex extend beyond 3.9 A resolution, whereas the catalytic 
module (Apc2 and Apcl11), as well as the WD40 domain of Cdc20, are 
more flexible (Extended Data Fig. 3a). 

APC/C@420-Hsll adopts an active conformation with the catalytic 
module in the upward position reminiscent of APC/C°"! #5"! (ref. 6) 
(Fig. 1a, b, Extended Data Fig. 4a, b and Supplementary Video 1) and 
in agreement with a low-resolution negative-stain EM reconstruction 
of APC/C“ (ref. 24). Relative to Cdh1¥>”, the Cde20W?“° domain 
is shifted away from the APC/C by as much as 10 A (Extended Data 
Fig. 3b). Its interaction with the APC/C involves only its N-terminal 
domain (Cdc20N") and the C-terminal Ile-Arg (IR) tail (Cdc20!8) 
(Fig. 1c-e, Extended Data Figs 3c-e and 4c, d). Compared with 
Cdh1N?, Cdc20N" forms fewer contacts with both Apel and Apc8B 
(Fig. 1c-e). However, the crucial C-box motif (DRYIPxR) repre- 
sents a structurally conserved region common to both coactivators 
(Fig. 1d, Extended Data Figs 3c, d and 4d). Cdc20° > forms a network 
of electrostatic interactions with Apc8B, centred on the crucial Arg78 
(ref. 23), and augmented by non-polar interactions involving its 
Tyr79 and Ile80 residues (Fig. 1d). A KILR motif also present within 
Cdc20N" is essential for Cdc20 association with the APC/C (ref. 25) 
and the APC/C@°-4s!! structure reveals that Ile130 and Leu131 of 
Cdc20*"8 are inserted into a hydrophobic pocket of the TPR (tetratr- 
icopeptide repeat) superhelix of Apc8B, further stabilizing the confor- 
mation of the C box (Fig. le). Similar C-box stabilization is present 
in Cdh1N", but is instead provided by a loop structurally unrelated 
to Cdc20%"8 (Fig. 1c)?*. By contrast, the KLLR-motif of Cdh1N"” is 
located in a leucine-zipper-like a-helix (a3) that forms a hydrophobic 
interface with Apcl (ref. 23) (Fig. 1c and Extended Data Fig. 4d). The 
absence of an equivalent to the Cdh1 a3 helix in Cdc20 suggests a 
weaker mode of binding of Cdc20N"” relative to Cdh1N". 

EM density for Cdc20" is weaker than for Cdh1® and it lacks the asso- 
ciated «-helix of Cdh1!® (Extended Data Fig. 3e). This could account 
for the lower affinity of the APC/C for Cdc20'® compared with Cdh1'® 
(ref. 17). Nonetheless, the crucial Ile—Arg interaction of Cdc20'8 with 
the TPR superhelix of Apc3A is conserved between the two coactivators 
(Extended Data Fig. 3e). Importantly, because in the APC/C©4?0-Hs!l 
EM structure, densities corresponding to phosphorylated residues 
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Figure 1 | EM reconstructions of the APC/ 
C0420-HsIl Complex and comparison of 
Cdc2087 and Cdh1%"”. a, b, Two views of 
APC/CC*20-H5! shown in cartoon with the 
D-box and Cdc20!* highlighted in surface 
representation. Cdc20 binds to the APC/C in 
juxtaposition to Apc10 to form the substrate 
recognition module. Apcl is composed of an 
N-terminal WD40 domain Apc1”“°, a middle 
domain Apcl™ and a C-terminal Apcl?° 
domain. Apcl1 is modelled based on the APC/ 
CC4hl-Emil structure (PDB 4UI9)*. c, Both 
Cdc20NT (purple) and Cdh1N™ (grey, aligned 
to APC/CO#0-Hs!1)23 interact with Apcl and 
Apc8B, whereas Cdh1N" contains an additional 
a3 helix associating with Apcl. d, The crucial 
C-box motif is well conserved between the two 
coactivators and forms extensive interactions 
with Apc8B. e, The KLLR motif of Cdh1 is 
present in the «3 helix to engage Apcl, whereas 
the related Cdc20*"® motif contacts Apc8B to 
augment C-box binding. 
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are not visible, we find no evidence that phosphorylated regions of 
the APC/C either directly or indirectly contact Cdc20. This suggested 
that APC/C phosphorylation invokes a conformational change of apo 
APC/C that promotes its association with Cdc20. 

To explore this possibility, we determined cryo-EM structures of apo 
APC/C in both the unphosphorylated and phosphorylated states at 
near-atomic resolution (Fig. 2a, b, Extended Data Fig. 5a, b, Extended 
Data Table 1 and Supplementary Video 1). In both states, the catalytic 
module adopts an inactive conformation (Extended Data Fig. 4a, b) as 
seen in the previous 8 A resolution reconstruction®. However, three- 
dimensional classification of the atomic resolution EM maps of both 
apo states showed that the majority of Apc3A adopts a closed con- 
formation resembling the Apc3 crystal structure in which an «-helix 
(TPR12A) occupies and blocks the IR tail-binding pocket (Extended 
Data Fig. 5c)*®. In ~30% of particles Apc3A adopts an open confor- 
mation identical to the IR tail-bound state. Thus inter-conversion 
of Apc3A between closed and open IR tail-accessible states is not 
controlled by phosphorylation. 

Phosphorylated and unphosphorylated apo APC/C EM maps are 
very similar in structure (Extended Data Fig. 5a, b), except for a nota- 
ble difference in the region of the C-box binding site (Fig. 2c-f). In 
unphosphorylated APC/C, an unassigned segment of EM density of 
~15 residues indicative of an elongated loop connected to a short 
a-helix is located at the C-box binding pocket of Apc8B (Fig. 2c-e). 
The equivalent EM density is not present in phosphorylated APC/C 
(Fig. 2f). The WD40 domain of Apcl, positioned in close proximity to 
this density, incorporates two highly phosphorylated regions (residues 
307-395 (300s loop) and residues 515-579 (500s loop)) (Fig. 2c, d and 
Extended Data Table 2), which have not previously been assigned in 
either APC/CO420-Hs11_ or APC/CC4- Emil structures”. The 300s loop 
would be predicted to project towards the C-box binding site of Apc8B 
(Fig. 2c, d and Extended Data Fig. 5d), implicating it as a candidate for 
the unassigned density segment. 

We determined the structure of an APC/C mutant with the 300s loop 
of Apcl deleted (APC/C4?*1-300s) (Fig. 2g and Extended Data Table 1a). 
In this structure, the C-box binding pocket of APC/C“4P¢!-3°s is devoid 
of EM density even without in vitro phosphorylation (Fig. 2g), consistent 
with its assignment to the 300s loop. Furthermore, ubiquitination 
assays showed that APC/C““P«!-3 was constitutively activated by 
Cdc20 and that phosphorylation did not enhance its activity (Fig. 3a, 
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wy, SF H 


a... 


Cdc20k!LR_Apc8B 


compare lanes 6, 7 to 8, 9). This indicates that unphosphorylated 
APC/C is maintained in an auto-inhibited conformation by an Apcl 
auto-inhibitory segment, present within the 300s loop that sterically 
impedes the C-box site from binding Cdc20. In support of this idea, 
analytical size-exclusion chromatography showed that Cdc20 forms a 
binary complex with phosphorylated APC/C, but not with unphospho- 
rylated APC/C (Extended Data Fig. 6a), in agreement with refs 10,12. 

To identify the auto-inhibitory segment within the Apcl 300s loop, 
we synthesized a set of eight overlapping peptides of 20 residues, span- 
ning the 300s loop, and tested their potential to inhibit APC/C*4Pc!-3 
(Fig. 3b). Notably, peptide 7 (residues 361-380, Fig. 3b, lane 9) potently 
suppressed Cdc20-dependent APC/C4P?«!-3s activity. Interestingly, an 
Arg368-Phe369 pair of this peptide, that resembles the Arg78-Tyr79 of 
the C box, is flanked by four serine residues phosphorylated in mitotic 
APC/C (Fig. 3c and Extended Data Table 2)!27, 

EM density for the auto-inhibitory segment is weak, probably due 
to partial occupancy at the C-box site (Fig. 3c). Nevertheless, side 
chain density similar to Arg78-Tyr79 of the C box suggests a fit for the 
Arg-Phe of peptide 7 (Arg368-Phe369) (Fig. 3c). To test the possibility 
that the Apcl auto-inhibitory segment corresponds to peptide 7, 
we synthesized mutants of peptide 7 and also introduced the equiv- 
alent mutations into Apcl of the recombinant APC/C. Replacing 
Arg368 with Glu in Apcl resulted in a Cdc20-dependent activation 
of unphosphorylated APC/C (Fig. 3d), and reduced the inhibitory 
potency of peptide 7 towards APC/C44P*!-3s (Extended Data Fig. 6c). 
A similar result was obtained on substituting glutamates for the four 
neighbouring serine residues (Ser364, Ser372, Ser373 and Ser377 
(APC/C4P*!48/£)) to mimic phosphorylation (Fig. 3d and Extended 
Data Fig. 6c). Phosphorylation of Ser377 of peptide 7 relieved the inhi- 
bition only partially. These findings suggest that Arg368 anchors the 
Apcl auto-inhibitory segment to the C-box binding site, mimicking 
the Cdc20 C box, and maintaining the apo APC/C in an auto-inhibited 
state. Phosphorylation of the four neighbouring serine residues would 
destabilize its association with the C-box site (Figs 3c and 4a). This 
mechanism could be further tested in an in vivo context by cellular assays. 

Our results so far reveal that the critical determinant of APC/C“*° 
activation by phosphorylation is displacement of the Apcl auto- 
inhibitory segment from the C-box site. However, since Apc3 is 
hyperphosphorylated in mitosis, and Cks stimulates both Cdk- 
dependent activation of APC/C°*° (refs 18,19) and Apcl and Apc3 
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Figure 2 | Apo unphosphorylated APC/C is repressed by an Apcl 
auto-inhibitory segment. a, b, Two views of the phosphorylated apo 
APC/C structure in cartoon within the 3.4 A EM map (grey). The catalytic 
module (Apc2 and Apc11) is in the inactive conformation. c, d, EM map of 
unphosphorylated apo APC/C. Apcl has two highly phosphorylated loops 
within its WD40 domain (green). Whereas the 300s loop (residues 307-395) 
is pointing towards the Apcl auto-inhibitory segment density 

(dark green) at the C-box binding site (black box), the 500s loop is 

facing in the opposite direction. The hyperphosphorylated Apc3 loop 


phosphorylation!’, and interacts with Apc3 (refs 18,27,28), we tested 
whether Apc3 phosphorylation also had a role in APC/C activation. In 
Apc3 about 50 phosphorylation sites are clustered in a large disordered 
loop comprising residues 180-450 (Extended Data Table 2) located on 
the same face of the APC/C as the Apcl 300s loop (Fig. 2c). In contrast 
to APC/C4P¢!-30s, instead of stimulating unphosphorylated APC/C, 
deletion of the Apc3 loop (APC/C44P-°P) reduced the phosphoryl- 
ation-mediated activation of APC/C (Fig. 3a, lanes 10-13). Similarly, 
Cdk2-cyclin A3 failed to stimulate the APC/C activity in the absence of 
Cks2 (Extended Data Fig. 1b). However, combining deletions of both the 
Apc3 and Apcl 300s loops (APC/C4?*1-300s4pc3-looP) restored activity 
to that of wild-type phosphorylated APC/C (Fig. 3a, lanes 14, 15). 
Since deletion of the Apc3 loop disrupts APC/C association with Cdk- 
cyclin—Cks (Extended Data Fig. Ic, lanes 6, 8), a likely explanation 
for our results and for the lag phase that accompanies APC/C activa- 
tion by Cdk1-cyclin B-Cks”, is that Apc3 phosphorylation recruits 
Cdk-cyclin-Cks through Cks'®?”8 to stimulate Apcl auto-inhibitory 
segment phosphorylation. Cdk-cyclin-Cks association with the Apc3 
loop would allow for a kinetically more efficient intra-molecular phos- 
phorylation of the Apcl auto-inhibitory segment that only becomes 
accessible to Cdk when transiently displaced from the C-box site 
(Figs 3c and 4a). 

To determine whether Apc3 loop-mediated interactions with the 
Cks2 subunit facilitated Apc1 300s loop phosphorylation, we analysed 
phosphorylation of the Apcl 300s loop in conditions where such inter- 
actions are disrupted. Either deletion of the Apc3 loop or omission of 
the Cks2 subunit from the phosphorylation reaction, conditions that 
reduce APC/C activation (Fig. 3a and Extended Data Fig. 1b), resulted 
in the same reduction of Apcl 300s loop phosphorylation (Extended 
Data Table 2). Specifically, mitotic phospho-sites associated with relief 
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(residues 180-447) is located at the back of the APC/C and functions 

as a Cdk recruitment site. The views in b and care similar to Fig. 1b. 

e-g, Close-up views of the C-box binding site in the EM maps of apo 
unphosphorylated and phosphorylated APC/C and an APC/C4P*1-300s 
mutant with the Apcl 300s loop deleted. An elongated loop density 

(dark green) for the Apcl auto-inhibitory (AI) segment was observed 

in the apo unphosphorylated state (e), but the density is absent in apo 
phosphorylated APC/C (f). Deletion of the Apcl 300s loop shows a similar 
loss of C-box site-associated density (g). 


of auto-inhibition (Ser364, Ser372 and Ser373) (Extended Data Table 2, 
columns 3 and 8) are not modified when Cdk targeting to Apc3 is 
disrupted (Extended Data Table 2, columns 4 and 5). In agreement 
with’?** Ser362 and Ser364 are phosphorylated by PIk1 and Cdk2- 
cyclin A3-Cks2, respectively, indicating that Ser364 phosphorylation 
confers partial activation, whereas phosphorylation of Ser372 and 
Ser373 requires the presence of both kinases, possibly owing to inter- 
dependent priming reactions. Since the stimulatory phospho-Ser364 
of the auto-inhibitory segment (Fig. 3c) is anon Cdk-consensus site, 
the relaxed specificity of Cdk2-cyclin A3—Cks2 phosphorylation of 
this site may be conferred through targeting of the kinase to the APC/C 
through the Apc3 loop. 

Cdc20 and Cdh1 bind to common sites on the APC/C yet only 
Cdc20 association is regulated by APC/C phosphorylation. In 
agreement with ref. 9, we find that Cdh1 activates unphosphoryl- 
ated and phosphorylated APC/C to a similar extent (Extended Data 
Fig. 6d, e). Comparing unphosphorylated APC/C and APC/C*4P*!-30s 
we detect some stimulation with APC/C44P*!30s at low Cdh1 
concentrations (Extended Data Fig. 6e). This is consistent with 
the enhancement of Cdh1 binding to phosphorylated APC/C and 
APC/C4?¢1-30% (Extended Data Fig. 6b). Testing the phosphorylation- 
dependent activity of a set of Cdc20-Cdh1 chimaeras indicated 
that the more extensive interactions involving Cdh1 and the 
APCIC, mediated by both Cdh1N?” and Cdh1!® (Extended Data 
Fig. 6d), with a contribution from the Cdh1N7P «3 helix (Fig. 1c 
and Extended Data Fig. 6f), account for the capacity of Cdh1 to 
activate unphosphorylated APC/C. Although Cdh1 association 
would also require displacement of the Apcl auto-inhibitory seg- 
ment, a potential mechanism to explain the dependency of Cdc20 
on APC/C phosphorylation is that the higher affinity of Cdh1 for the 
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Figure 3 | The Apcl auto-inhibitory segment binds to the C-box binding 
site and mimics the Cdc20© >. a, A native substrate Cdk2-cyclin 
A2-Cks2 was used for ubiquitination assays. In vitro phosphorylated 
APC/C (both Cdk2-cyclin A3-Cks2 and Plk1) can be activated by Cdc20 
(lanes 1-5). Deletion of the Apcl 300s loop activated the APC/C without 
phosphorylation (lanes 6, 7), and kinase treatment of APC/C44P*!-3%s did 
not enhance APC/C activity. The APC/C44PS"e°P mutant showed similar 
activity as unphosphorylated APC/C (lanes 10, 11 versus 2, 3 and 4, 5), 

but had reduced activation by phosphorylation. Nevertheless, deletion of 
both Apcl 300s and Apc3 loops (APC/CA4P¢!-300s:4pc3-loop) restored activity 
to that of wild-type (WT) phosphorylated APC/C and unphosphorylated 
APC/C4P*!-300s (Janes 14-17). b, Identification of the Apcl auto- 
inhibitory segment occupying the C-box binding site by assessing the 
inhibitory effect of eight peptides spanning the Apcl 300s loop. A single 
peptide (peptide 7, residues 361-380) suppressed the activity of 
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Figure 4 | Mechanism for APC/C activation by mitotic 
phosphorylation and the molecular basis for TAME inhibition. 

a, Cartoon showing the mechanism of APC/C activation by Apcl and 
Apc3 phosphorylation-induced relief of auto-inhibition. Artificial relief 
of auto-inhibition, either by deletion of the Apcl auto-inhibitory (AI) 
segment or by its phospho-mimicking mutants, obviates the need to 
phosphorylate Apc3. b, TAME has only a small inhibitory effect on APC/ 
CC4h! (Janes 1-4), whereas it markedly reduced APC/C“™ activity 
(lanes 5-8). The activity assay was performed with phosphorylated 
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APC/C*4P*1-39% (Jane 9), indicating that this peptide blocks the C-box 
binding site. A control with wild-type unphosphorylated APC/C 
(unpAPC/ CW") is in lane 11. c, The Apel auto-inhibitory (AI) segment 
(peptide 7, residues 361-380) shares sequence similarity with Cdc20°>™. 
A model for the auto-inhibitory segment (green) was fitted into the EM 
density of the apo unphosphorylated APC/C map (grey). Arg368 overlaps 
with the crucial Arg78 of Cdc20°” (purple, right panel). The flanking 
serines shown to be phosphorylated are highlighted as red spheres. Ser377 
is outside the observed EM density. d, Mutation of a single Arg368 residue 
(APC/CAPT-R368E) or mutating its four neighbouring serine residues 
(Ser364, Ser372, Ser373, Ser377) to glutamates (APC/C4P!45/£) activated 
the APC/C without phosphorylation. 30 nM Cdc20 was used for the assay 
in a and 20 nM Cdc20 for the assays in b and d. Experiments in a and d 
were replicated three times and in b five times. See Supplementary Fig. 1 
for gel source data. 
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wild-type (WT) APC/C and substrate Hsl1 at a coactivator concentration 
of 10nM. This experiment was replicated three times. See Supplementary 
Fig. 1 for gel source data. c, d, EM reconstruction of APC/C44?*!-30 in 
complex with TAME showed densities (dark blue) for TAME (C-atoms 
in lime green) at both the IR tail-binding site (c), and the C-box binding 
site (d). Their positions overlap well with the crucial Arg78 and Tyr79 of 
Cdc20°>* and Tle498 and Arg499 of Cdc20'%, thereby inhibiting Cdc20 
association (Extended Data Fig. 7). 
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APC/C (Extended Data Fig. 6e) is sufficient to compete for the auto- 
inhibitory segment at the C-box binding site. 

Our findings suggest the interesting possibility of exploiting differ- 
ences in the affinities of the two coactivators for the design of inhibitors 
that specifically target mitotic APC/C“*™ and thus suppress cell prolif- 
eration. A small molecule, tosyl-L-arginine methyl ester (TAME), was 
reported to inhibit APC/C activation by both Cdc20 and Cdh1 through 
a proposed mechanism involving competition for the IR tail-binding 
site of coactivator?’. However, we found that TAME is a more potent 
inhibitor of APC/C°* than APC/C°*"! (Fig. 4b). To understand the 
molecular basis underlying this inhibition, we determined the structure 
of APC/CA4P¢1-300s in complex with TAME (Extended Data Table 1). 
We observed TAME density not only at the IR tail-binding site, but also 
at the C-box binding site (Fig. 4c, d). This is consistent with the struc- 
tural similarities of the IR tail and C-box bindings sites of Apc3A and 
Apc8B, respectively”, although there are critical differences. In Apc3A, 
TAME is reminiscent of the Ile-Arg motif of the coactivator IR tail, 
whereas in Apc8B, TAME resembles the Arg—Tyr/Phe of the coactivator 
C box (Extended Data Fig. 7). The mechanism of TAME inhibition of 
APC/C“° through a tosyl-Arg motif to block the Cdc20°>* binding 
site is reminiscent of the Apcl auto-inhibitory segment that is displaced 
by APC/C phosphorylation. Both utilize a common structural motif 
(Arg-aromatic) that mimics Arg78 and Tyr79 of Cdc20°°* to exploit 
the lower (relative to Cdh1) affinity of Cdc20 for the APC/C. 

How phosphorylation regulates mitotic APC/C activation by 
Cdc20 has been a long-standing puzzle. Our in vitro studies show 
that of almost 150 phosphorylation sites in mitotic APC/C, only a 
few in Apcl directly regulate Cdc20 binding through displacement 
of the auto-inhibitory segment. This work has relevance to under- 
standing the control of other large multimeric complexes by multi-site 
phosphorylation. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 


No statistical methods were used to predetermine sample size. The experiments 
were not randomized and the investigators were not blinded to allocation during 
experiments and outcome assessment. 

Expression and purification of recombinant human APC/C. The genes for 
recombinant human APC/C were cloned into a modified MultiBac system, 
expressed and purified as described*®. The C terminus of Apc4 was fused to a 
TEV (tobacco etch virus)-cleavable StrepII x2 tag. 

Protein kinase purification. All four proteins (Cdk2, cyclin A3 (residues 
174-432), Cks2 and Plk1 (residues 37-338)) for the kinases were expressed indi- 
vidually in BL21 (DE3) Star cells at 18°C overnight. Pellets containing GST- 
tagged Cdk2, His-tagged cyclin A3 and His-SUMO-tagged Cks2 were combined 
and resuspended in the CDK lysis buffer (50 mM Tris/HCl pH 7.4, 180 mM 
NaCl, 5% glycerol and 2mM DTT) supplemented with 0.1 mM PMSE, lysozyme, 
5 units per ml benzonase and Complete EDTA-free protease inhibitors (Roche). 
After sonication, the cells were centrifuged at 20,000 r.p.m. for 1h at 4°C and the 
supernatant was incubated with the Glutathione Sepharose 4B (GE Healthcare) 
for 3h at 4°C. The resins were washed with the CDK lysis buffer and the GST- 
tag of Cdk2 was cleaved off with 3C PreScission protease overnight at 4°C. The 
flow-through from the resins was collected and TEV-cleaved overnight at 4°C. 
Finally, the protein complex was purified by a Superdex200 16/60 column (GE 
Healthcare). 

The Polo kinase Plk1 with an N-terminal His-MBP tag was purified with a 
HisTrap HP column (GE Healthcare) in the PLK lysis buffer (50 mM Tris/HCl 
pH7.5, 300mM NaCl, 20 mM imidazole, 5% glycerol and 2mM {-mercaptoethanol). 
The column was washed intensively with high-salt buffer (50 mM Tris/HCl 
pH 7.5, 1 M NaCl, 20 mM imidazole, 5% glycerol and 2mM 8-mercaptoethanol). 
Proteins were eluted with a gradient of the elution buffer (50 mM Tris/HCl pH 7.5, 
300 mM NaCl, 300 mM imidazole, 5% glycerol and 2mM (6-mercaptoethanol) 
followed by TEV-cleavage overnight at 4°C. The sample was re-applied onto the 
HisTrap HP column to remove the His tag, uncleaved proteins and nickel con- 
taminations. The collected flow-through was concentrated and loaded onto a 
Superdex75 16/60 column (GE Healthcare) for final purification. 

In vitro phosphorylation of recombinant human APC/C. Concentrated APC/C 
after Resource Q (GE Healthcare) was treated with Cdk2-cyclin A3-Cks2 and Plk1 
in a molar ratio of 1:1.5 (APC/C: kinases) in a reaction buffer of 40 mM HEPES 
pH 8.0, 10mM MgCl and 0.6mM DTT with 5mM ATP and 50mM NaF. The 
reaction mixture was incubated at 30°C for 30 min before the final purification 
step by a Superose 6 3.2/300 column (GE Healthcare) in the APC/C gel-filtration 
buffer (20mM HEPES pH 8.0, 150 mM NaCl, 0.5mM TCEP). 

Expression and purification of the substrate Cdk2-cyclin A2-Cks2 and 
Cdc20. Full-length human cyclin A2 was cloned into the pETM41 vector with 
an N-terminal His-MBP tag. The protein was expressed in BL21 (DE3) Star cells 
at 18°C overnight. Pellets containing Cdk2, cyclin A2 and Cks2 were co-lysed for 
purification following a similar protocol as for the kinase purification. 

Full-length human Cdc20 was cloned into a modified pFastBac HTa vector with 

an N-terminal His-MBP tag. The generated virus was amplified and expressed in 
Sf9 cells. Harvested cell pellets were resuspended in Cdc20 lysis buffer (50 mM 
HEPES pH 7.8, 500 mM NaCl, 30 mM imidazole, 10% glycerol and 0.5 mM TCEP) 
supplemented with 0.1 mM PMSE, 5 units per ml benzonase and Complete EDTA- 
free protease inhibitors and loaded onto a HisTrap HP column (GE Healthcare). 
Proteins were eluted with a gradient to 300 mM imidazole. Collected peak frac- 
tions were TEV-cleaved overnight in the dialysis bag (cut-off 6-8 kDa) against the 
dialysis buffer (50 mM HEPES, pH 7.8, 300 mM NaCl, 5% glycerol and 0.5 mM 
TCEP) at 4°C. The protein was re-applied onto the HisTrap HP column and the 
flow-through was collected. 
Complex formation of APC/CC4?0-Hs!! and APC/CAAP1-300s-TAME, Ty vitro 
phosphorylated APC/C was treated with 401M CDK1/2 inhibitor 111 (ENZO Life 
Sciences) before incubating with purified Cdc20 and Hsl1 (with a molar ratio of 
1:1.5:2) on ice. The complex was purified by a Superose 6 3.2/300 column using 
the Microakta system. 

TAME (Sigma-Aldrich) was dissolved in 50% DMSO and 50% APC/C gel- 
filtration buffer at a concentration of 1 M.4mM TAME was added to purified 
APC/C4Pct-300s and incubated on ice for 1 h before freezing cryo-grids. 
Ubiquitination assays. The ubiquitination assay was performed with 60 nM 
recombinant human APC/C, 90nM UBA1, 300nM UbcH10, 300nM Ube2S, 
70M ubiquitin, 2}1M substrate Cdk2-cyclin A2-Cks2 or Hsl1, 5mM ATP, 
0.25 mg ml~! BSA, 151M CDK1/2 inhibitor 11 and different concentrations of 
purified human Cdc20 (5-30 nM) or Cdh1 (5-30nM) in a 1011 reaction volume 
with 40 mM HEPES pH 8.0, 10mM MgCl, and 0.6mM DTT (figure legends indi- 
cate the exact coactivator concentration used in each assay). Reaction mixtures 
were incubated at room temperature for various time points and terminated by 
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adding SDS/PAGE loading dye. Reactions were analysed by 4-12% NuPAGE 
Bis-Tris gels followed by western blotting with an antibody against the His-tag of 
ubiquitin (Clonetech cat. code: 631212). 

For the chimaeric Cdh1-Cdc20 assay, the following domain boundaries were 
used to generate chimaeric Cdh1-Cdc20 proteins: Cdh1NT (residues 1-168), 
Cdh1W?“° (residues 169-475), Cdh1!® (residues 476-496), Cdc20N™ (residues 
1-165), Cdc20WP“ (residues 166-475), Cdc20!® (residues 476-499). 10nM of the 
chimaeric proteins were used in the assay. 

Peptide and TAME assays. Eight peptides (Designer BioScience) spanning the 
Apcl 300s loop were synthesized for identification of the Apcl auto-inhibitory 
segment. Each peptide contains 20 amino acids, with a ten-residue overlap with 
the neighbouring peptides: peptide 1 (LTAHLRSLSKGDSPVTSPFQ); peptide 2 
(GDSPVTSPFQNYSSIHSQSR); peptide 3 (NYSSIHSQSRSTSSPSLHSR); peptide 4 
(STSSPSLHSRSPSISNMAAL); peptide 5 (SPSISNMAALSRAHSPALGV)); peptide 
6 (SRAHSPALGVHSFSGVQREN); peptide 7 (HSFSGVQRFNISSHNQSPKR) 
and peptide 8 (ISSHNQSPKRHSISHSPNSN). The following mutant peptides 
of peptide 7 were used to assess the relief of auto-inhibition: peptide R368E 
(HSFSGVQEFNISSHNQSPKR), peptide 4S/E (HSFEGVQRFNIEEHNQEPKR) 
and peptide pS377 (HSFSGVQRFNISSHNQphosphoSPKR). The peptides 
were dissolved at a concentration of 10mM in 100% dimethylsulfoxide (DMSO) 
and diluted using the APC/C gel-filtration buffer. The ubiquitination assay 
was performed using a final concentration of 200\1M peptide. The TAME 
assay was performed at a similar condition using 0.5-3 mM TAME and the 
substrate Hsl1. 

Size-exclusion chromatography to assess coactivator binding. Purified APC/C 
samples (1 mg ml~!) were incubated with either Cdh1 or Cdc20 at a molar ratio 
of 1:1.5 on ice for 30 min. The sample was spun down at 13,000 r.p.m. for 5 min to 
remove any precipitates or aggregates before injecting onto a Superose 6 3.2/300 
column using the Microakta system. The eluted peak fractions were analysed by 
SDS-PAGE and western blotting. 

Mass spectrometry. Purified proteins were prepared for mass spectrometric 
analysis by in solution enzymatic digestion, without prior reduction and alkyla- 
tion. Protein samples were digested with trypsin or elastase (Promega), both at an 
enzyme to protein ratio of 1:20. The resulting peptides were analysed by nano-scale 
capillary LC-MS/MS using an Ultimate U3000 HPLC (ThermoScientific Dionex) 
to deliver a flow of approximately 300 nl min~!. A C18 Acclaim PepMap100 
54m, 100j1m x 20mM nanoViper (ThermoScientific Dionex), trapped the pep- 
tides before separation on a C18 Acclaim PepMap100 31m, 751m x 250mM 
nanoViper (ThermoScientific Dionex, San Jose, USA). Peptides were eluted 
with a 90 min gradient of acetonitrile (2% to 50%). The analytical column out- 
let was directly interfaced via a nano-flow electrospray ionization source, with 
a hybrid quadrupole orbitrap mass spectrometer (Q-Exactive Plus Orbitrap, 
ThermoScientific). LC-MS/MS data were then searched against an in-house LMB 
database using the Mascot search engine (Matrix Science)*”, and the peptide iden- 
tifications validated using the Scaffold program (Proteome Software Inc.)**. All 
data were additionally interrogated manually. 

Electron microscopy. Freshly purified APC/C samples were first visualized by 
negative-staining EM to check the sample quality and homogeneity and to get 
initial low-resolution reconstructions. Micrographs were recorded on an FEI Spirit 
electron microscope at an accelerating voltage of 120kV and at a defocus of approx- 
imately —1.5 1m. For cryo-EM, 2-1] aliquots of the sample at ~0.15 mg ml! were 
applied onto the Quantifoil R2/2 grids coated with a layer of continuous carbon 
film (approximately 50 A thick). Grids were treated with a 9:1 argon:oxygen plasma 
cleaner for 20 to 40s before use. The grids were incubated for 30s at 4°C and 
100% humidity before blotting for 5s and plunging into liquid ethane using an FEI 
Vitrobot III. The grids were loaded into an FEI Tecnai Polara electron microscope 
at an acceleration voltage of 300kV. Micrographs were taken using EPU software 
(FEI) at a nominal magnification of 78,000 which yields a pixel size of 1.36 A per 
pixel. They were recorded by an FEI Falcon III direct electron detector with a 
defocus range of —2.0 to —4.0j1m. The exposure time for each micrograph was 2s 
at a dose rate of 27 electrons per A? per s. 34 movie frames were recorded for each 
micrograph as described*?. 

Image processing. All movie frames were aligned by the motioncorr program™ 
before subsequent processing. First, the contrast transfer function parameters 
were calculated with CTFFIND3 or Gctf**°°, Particles in 264 pixels x 264 pixels 
were selected by automatic particle picking in RELION 1.4 (ref. 37). The following 
steps were performed to exclude bad particles from the data set: (1) automatically 
picked particles in each micrograph were screened manually to remove ice con- 
taminations**; (2) after particle sorting in RELION, particles with poor similarity 
to reference images were deleted; (3) 2-dimensional classification was performed 
and particles in bad classes with poorly recognizable features were excluded. 
The remaining particles were divided into six classes using three-dimensional 
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classification in RELION. During this process particles with conformational 
heterogeneity and leftover bad particles were removed from the final reconstruc- 
tion. After 3D refinement, beam-induced particle motion was corrected using 
particle polishing in RELION**”®. All resolution estimations were based on the 
gold-standard Fourier shell correlation (FSC) calculations using the FSC = 0.143 
criterion. The model for the apo state (Fig. 2a, b, combined apo APC/C structure) 
was built based on a 3.4 A resolution map, reconstructed by combined data collected 
from pure apo phosphorylated APC/C and the apo state particles classified from 
the APC/C420-Hs!l complex (Extended Data Fig. 2 and Extended Data Table 1). 
This map allowed for model building of regions which were less well-resolved and 
presented as poly-alanines in the previous APC/C°*"! =™! structure”, including 
Apc2N™, Apcl loops in both the WD40 domain and the middle domain and 
regions in ApcSN™?. A summary of all EM reconstructions obtained in this work 
is listed in Extended Data Table la. 

Model building. Model building of both apo APC/C and APC/C°40#5!! struc- 
tures were performed in COOT”. Initially, available atomic structure of human 
APC/CC*-Emil (PDB 4UI9)*? and the crystal structure of Cdc20W?“° (PDB 
4GGC)"" were rigid-body fitted in individual subunits into the cryo-EM maps 
in Chimera®. All fitted structures were rebuilt according to the cryo-EM map. 
Cdc20N™, Cdc20°™ and several loop regions not seen in previous structures were 
built ab initio. The models were refined by REFMAC 5.8 (ref. 43). A REFMAC 
weight of 0.04 was defined by cross-validation using half reconstructions“. A res- 
olution limit of 4.0 A or 3.5 A was used for the APC/CC*?°#s"! and the apo APC/C 
structure, respectively. All available crystal structures or NMR structures were 
used for secondary structure restraints. The refinement statistics are summarized 
in Extended Data Table 1b. 

Map visualization. Figures were generated using Pymol and Chimera™. 
Sequence alignment. Sequence alignment was performed using Jalview”’. 
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Extended Data Figure 1 | Preparations and EM images of different 
APC/C samples used for structural studies. a, Recombinant human 
APC/C was phosphorylated in vitro using Cdk2-cyclin A3, Cdk2-cyclin 
A3-Cks2 or Plk1 alone or with both Cdk2-cyclin A3-Cks2 and Plk1. 

The phosphorylated APC/C samples are shown after SDS-PAGE. b, In vitro 
phosphorylated recombinant human APC/C can be fully activated by 
Cdc20 to ubiquitylate a native substrate Cdk2-cyclin A2-Cks2 when both 
kinases were added (lanes 9, 10). Without Cks2 (lanes 3, 4) or with Plk1 
alone (lanes 7, 8) no activation of the APC/C could be observed, whereas 
treating with Cdk2-cyclin A3-Cks2 alone (lanes 5, 6) resulted in its partial 
activation. Samples were recorded at 15 and 30 min of the reaction and 

20 nM of Cdc20 was used. This experiment was replicated three times. 


Anti-Apc3 antibodies (BD Bioscience, cat. code: 610454) were used as a 
loading control. c, Purified wild-type (WT) APC/C and mutant samples 
with and without kinase treatment (both Cdk2-cyclin A3—Cks2 and 
Plk1). Upon deletion of the Apc3 loop, no association of the Cdk2-cyclin 
A3-Cks2 kinase to the APC/C could be observed (lanes 6 and 8). d, SDS- 
PAGE of purified APC/C°*?°-#*" ternary complex. e, A typical cryo-EM 
micrograph of APC/C°°-Hs!! representative of 15,582 micrographs. 

f, Gallery of two-dimensional averages of APC/C°4?0-#5"! showing 
different views; representative of 100 two-dimensional averages. 

g, Gold-standard FSC curves of all APC/C reconstructions in this work. 
See Supplementary Fig. 1 for gel source data. 
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3D-refinement 
179,660 particles 


Extended Data Figure 2 | Three-dimensional classification of APC/ conformation; (iii) 71.6% in the inactive apo state; (iv) 8.1% with poorer 
CC420-Hs!_ The initial particles after two-dimensional classification were reconstruction owing to some bad particles. The first class in the active 
divided into six classes by three-dimensional classification using RELION. __ ternary state containing 179,660 particles was used for three-dimensional 
The resultant classes were grouped into four categories: (i) 9.0% in the refinement and movie correction to obtain the final reconstruction at 
active ternary state with coactivator and substrate bound; (ii) 11.3% in 3.9A. 


a hybrid state with coactivator bound, but the APC/C in the inactive 
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Cdh1°7 
Cdce20° 


Apcé6A | 
Cdc20°T vs Cdh1°' Cdc20'R-Apc3A 


Extended Data Figure 3 | Comparison of Cdc20 and Cdh1 association much as 10 A. c, d, EM density for Cdc20°>® allowed for ab initio model 
to the APC/C. a, The catalytic module (Apc2-Apcl11) of the APC/CC4?°#s!___ building and the C-box interaction with Apc8B (cyan) is well conserved 


t 


complex is flexible and almost no density accounting for Apc11 (pink, between the two coactivators. e, Both Cdc20!8 (right) and Cdh1'® (left) 
modelled based on the structure of APC/C°h!=™!, PDB 4UI9)** could associates with Apc3A (orange), although the EM density for Cdc20" is 
be observed. b, The WD40 domain of Cdc20 (purple) occupies a similar much weaker (not shown) and the C-terminal a-helix in Cdh1" is absent. 


position as Cdh1““? (grey), but it is displaced from the APC/C by as 
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c substrate recognition 
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Extended Data Figure 4 | Conformational changes of the APC/C 
between the inactive apo and the active ternary states and domain and 
sequence analysis of Cdc20. a, b, Subunits that undergo conformational 
changes upon coactivator and substrate binding are highlighted in their 
ternary state and coloured as in Fig. 1, while the corresponding proteins in 
the inactive apo state are in lighter shades. In the active conformation, the 


platform subdomain containing subunits Apcl, Apc4 and Apc is shifted 
upward, inducing a large movement of the catalytic module to enable 

E2 access. c, Domain organization of Cdc20. d, Sequence alignment of 
Cdc20N? and Cdh187” with a-helices represented as cylinders (purple 
and grey for Cdc20N™ and Cdh1N", respectively) underneath the 
sequences and the C-box and KILR/KLLR motif highlighted. 
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phosphorylated 


Cdce20'R 


open conformation 


unphosphorylated apo APC/C 


apo APC/C 
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Apcl_MOUSE 306 RSLSKGESPVASPFONYSSIHS STSSPS ee 347 
Apcl_XENTR 307 RSVSKGESPTASPFONFSSLHS SVS SH 350 
Apcl_DANRE 302 IRNVNRLDSPASP--LHCHALHN IPA VH 343 

Residues 361-380 

® [PP] e 
Apcl_HUMAN 348 IISSHNQISPKRIHS! SHSPNSNS 391 
Apcl_MOUSE 348 LISSHSQSPKRHS1 SH|SPISGSF 391 
Apcl_XENTR 351 FSSNTPSPKRILGASGISPNSTS 394 
Apcl_DANRE 344 MSICHTPISSRGIHSILAISPNSTL 387 

372373. 377 

® : 

Apcl_HUMAN 392 NGSFILAP DH ITN I NS 435 
Apcl_MOUSE 392 NDSFILAP DH LPN I NS 435 
Apcl_XENTR 395 - SD FILISV LDH VAN I NS 437 
Apcl_DANRE 388 ND- T|ELP EQ SAIS GCQAS SD) 430 


Extended Data Figure 5 | Comparison of apo APC/C in 
unphosphorylated and phosphorylated states. a, b, Superposition of 
the apo unphosphorylated (magenta) and phosphorylated (cyan) APC/C 
EM maps revealed little conformational differences except in the vicinity 
of the C-box binding site. c, Apc3A is in an equilibrium between open 
(light blue) and closed (orange) conformations. While in the inactive apo 


state, the majority of Apc3A is in the closed state, association of Cdc20!% 
stabilizes the open state. d, Sequence alignment of the Apcl 300s loop 
across different species human, mouse, Xenopus tropicalis (western clawed 
frog) and Danio rerio (zebrafish). Phosphorylation sites are indicated and 
residues accounting for the Apcl auto-inhibitory segment (361-380) are 
boxed. 
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Extended Data Figure 6 | See next page for caption. 


© 2016 Macmillan Publishers Limited. All rights reserved 


LETTER 


Extended Data Figure 6 | Analytical gel filtration and activity assays. 

a, With equal amount of input Cdc20, phosphorylated APC/C could form 
a stable binary complex with Cdc20 after a gel-filtration purification step 
(lane 5), whereas unphosphorylated APC/C could not (lane 4). b, Both 
unphosphorylated and phosphorylated APC/C associate with Cdh1 stably 
on gel filtration, as well as APC/C“AP*!3, Anti Cdc20 antibody (Santa 
Cruz Biotechnology, cat. code: sc-8358) and anti Cdh1 antibody (Sigma, 
cat. code: C7855) were used for detection; antibody to Apc4 

(ref. 6) served as a loading control and unphosphorylated APC/C alone 

is used as a negative control for western blotting. c, Point mutations 

of peptide 7 (residues 361-380), either when Arg368 was mutated to 
glutamate or when the four neighbouring serines were mutated to 
phospho-mimics (Glu), caused the peptide to abolish its inhibition effect 
and restored the APC/C activity (lanes 4, 5). Phosphorylation of a single 
Ser377 only resulted in partial activation of the APC/C (lane 6). 

d, Chimaeric proteins composed of the NTD, the WD40 domain and the 
IR tail of either Cdc20 or Cdh1 were purified to study their differences 


in APC/C activation. Both the NTD and the CTD of the coactivators are 
essential for their association with the APC/C. Swapping both NTD and 
CTD of Cdh1 with Cdc20 makes it phosphorylation sensitive (lanes 7, 8), 
similar to Cdc20 (lanes 9, 10) and vice versa. e, Top, Cdh1 can activate 
both unphosphorylated and phosphorylated APC/C similarly, whereas 
Cdc20 requires APC/C phosphorylation for its activity. Bottom, a titration 
of Cdh1 against unphosphorylated APC/C and APC/C*4P! 3s showed 
enhanced activity in the absence of the Apcl auto-inhibitory segment at 
low Cdh1 concentration (<10nM), whereas Cdc20 requires displacement 
of the auto-inhibitory segment for its activity. f, Deletion of the Cdh1 a 

3 helix resulted in reduced activation of the APC/C and makes Cdh1 
more phosphorylation sensitive. The substrate Cdk2-cyclin A2-Cks2 

was used for assay in c and Hsl1 for the assays in d-f. 20 nM Cdc20 was 
used in c, 10nM chimaeric coactivators in d and 30 nM coactivators in f. 
Experiments in a and b were replicated two times, in c, e and f three times 
and in d four times. See Supplementary Fig. 1 for gel source data. 
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TAME-Apc8A 


Extended Data Figure 7 | TAME competes with Cdc20 to bind at the 
IR-tail and the C-box binding sites. a, TAME (C atoms in lime green) 
is superimposed with Cdc20!® (purple) and the arginine motif in both 
structures engages the same binding site on Apc3A (orange). b, The 


LETTER 


TAME vs Cdc20° >° 


tosyl-Arg motif of TAME overlaps with Arg78-Tyr79 of Cdc20°>® at the 

C-box binding site to out-compete Cdc20. c, A density for TAME was also 
observed within a pocket of the Apc8A TPR super-helix, similar to that of 
Apc8B. 
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Extended Data Table 1 | EM data collection, processing statistics and structure refinement statistics 
a. Statistics of all cryo-EM reconstructions 


Samples Particles used for final Resolution (A) EM-DB 
reconstruction accession code 

APGIG etre 179,660 (9.0%) 3.9 3385 

apo unphosphorylated APC/C 347,317 (49.5%) 3.8 3386 

apo phosphorylated APC/C 83,642 (63.1%) 4.3 3387 

(apo phosphorylated APC/C data only) 

combined apo phosphorylated APC/C 921,993 (70.8%) 3.4 3388 

(both apo phosphorylated APC/C data and apo 

phosphorylated particles classified from APC/C°>#5") 

apo APC/C*P°+-3°°s mutant 262,090 (80.2%) 3.8 3389 

ARGC iS eMs 246,065 (78.2%) 4.0 3390 


b. Statistics of APC/C°"2°""" and combined apo phosphorylated APCIC structure determination 


Data collection 


APCIC Cdc20-HsI1 


combined apo phospho. APC/C 


EM FEI! Polara, 300k eV FEI Polara, 300k eV 

Detector FEI Falcon Ill FEI Falcon Ill 

Pixel size (A) 1.36 1.36 

Defocus range (um) 2.0-4.0 2.0-4.0 
Reconstruction 

Software RELION 1.4 RELION 1.4 

Accuracy of rotations (degrees) 1.268 0.884 

Accuracy of translations (pixels) 0.81 0.62 

Final resolution (A) 3.9 3.4 
Refinement 

Software RefMac 5.8 RefMac 5.8 

Refmac weight 0.04 0.04 

Resolution limit (A) 4.0 3.5 

Residue number 8164 7908 

Average Fourier shell correlation 0.7568 0.7778 

R factor 0.3621 0.3451 

Rms bond length (A) 0.0121 0.0141 

Rms bond angle (°) 1.7812 1.7435 


Validation 
Ramachandran plot 


Preferred 7502 (91.89%) 7427 (93.92%) 
Allowed 400 (4.90%) 287 (3.63%) 
Outliers 262 (3.21%) 194 (2.45%) 


RCSB PDB accession code 


5G04 


5G05 
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Extended Data Table 2 | Phosphorylation sites of Apcl and Apc3 subunits of in vitro phosphorylated APC/C identified by mass 
spectrometry 


Phospho-sites |APc/c™T apc/c™t Aapc/c**cse0r | apcycWT apc/c™" Comparison with 
(Cdk2-cyclinA3-|Cdk2-cyclinA3- Cak2-cyclinA3-|puplished studies Ret. 
\Cks2_+ PlkT |Cks2 |Cks2_+ Pik1 
LVGS6OLQE 22 LPNSTS3CTT 
QEVTSSIHE 22(interphase) NSCT185TQV 
PPGS202PRE 12,21,22 ISCTT186QVP_ 
LFGS233SRV_[N.D. LD. 21 PNHS192LSH 
LKFS286EQG 12,22 IHSLS194HRQ_ 
QGGT291PQN 12,21,22 QPET200VLT 
NVAT297SSS TVLT203ETP 
VATS298SSL LTET205PQD 12,21,22 
IATSS299SLT PQDT20SIEL 12 
RSLS3O9KGD .D. 22(interphase) INLES219SNS .D. 22(interphase) 
KGDS313PVT 22 LESS220NSK _|N.D. 21,22 
SPVT316SPF 22 SSNS222KYS 22 
PVTS317PFQ 22 SKYS225LNT 
HSRS347 PSI 72,21,22 SLNT228DSS 
RSPS3431SN 12,22 INTDS280SVS 
AALS351 RAH 22 TSDS231VSY 
RAHS355PAL 12,21,22 SSVS233VID 
|GVHS362FSG 12,21,22 YIDS237 AVI 
HSFS3646VQ 12,22 AVIS241 PDT 
FNIS372SHN 22 [SPDT 244VPL 
NISS373HNQ. 12,22 GTGT251 SIL 
HNQS377 PKR 12,21,22 JTGTS252SIL 
ISHS386PNS 3 22(interphase) SILS255KQV. 
SPNS389NSN LD. 22 KPKT264GRS 
SNGS394FLA |TGRS267LLG 
WTET41 6ITN [AALS276PLT 
}VLYTSOIGVV [SPLT279PSF 
PAPSS18LTM 3 1D. 22 LTPS281 FGI 
PSLTSZOMSN_[N.D. LD. 72(interphase),22 PLET289PSP 
LTMS522NTM .D. ID. 22(interphase) |ETPS291 PGD 
MSNT5S24MPR 22(interphase) IQNYT302NTP 
RPSTS30PLD 12(interphase),21,22 YTNTSO4PPV 
DGVSS36TPK IDVPS312TGA 
[GVSTS37PKP 12,22 VPST313GAP 
KPLS542KLL LD. 21 IGQT327GTK 
LLGSS47LDE 12,22 QTGT329KSV 
VLLSSS5PVP 12,21,22 GTKS331 VFS 
LRDSS63SKL 22 ISVFS334QSG 
RDSSS64KLH_[N.D. 21,22 FSQS336GNS 
LHDSS69LYN 12(interphase),22 REVT343PIL 
EDCTS76FQQ LAQT349Q5S 
QLGTS82YIH QTQS251 SGP 
LELS600NGS 1D. 21 ITQSS352GPQ 
FEGSGBELSP 22 GPQT356STT 
GSLS688PVI Tz 21,22 PQTS357TTP. 
ARPSOSSETG 22 QTSTSseTrQ 
PSET701GSD 22 TSTT359PQV 
ETGS703DDD 22 (interphase) QVLS364PTI 12 
UCLS731PSE 12,22(interphase),21 LSPT366ITS 12,21 
NRFS916FRH 22 PTITS68SPP 
FRHS920TSV TITS369PPN 12,21,22 
RHST921SVS RLFT383SDS 21 
HSTS922VSS LFTS384DSS 21 
VLSS1001DVP TsDs3e6sTT 21 
KHKS1347PSY SDSS387TTK 21 
KSPS1345¥Q) DSSTSSETKE 
[SSTT3S9KEN 21 
|GGIT419QPN 
INDS426LEI 
The pink shading shows the presence of phosphorylation sites and the white indicates its absence. LEIT430KLD 
The phosphorylation sites within the Apc1 Al segment are highlighted in light purple shadows. KLDS4345I 
LOSS435iIS 
SIS43 BEGK 
(GKIS443T1T 
KIST4441TP_ 
STIT446PQ) 
MNFS761WAM 
IMGTSOODES 
TDESSO3QES. 
ISQESSO6SMT 
QESS807MTD 
[SSMTBO9DAD 
|ADDTS8 1 4QLH 
[AAES821 DEF 


wT pes oop WT WT 
Phospho-sites APC/C' japc/c™ |APC/C! |APC/C comparison with 


(Cak2-cyclinA3-|Cdk2-cyclinA3- ICak2-cyclinA3-|published studies Ret. 
|Cks2 + PIk1 |Cks2 |Cks2 + Pik 


Kinases used Cdk2-cyclinAs 


Kinases used |No treatment |Cdk2-cyclinA3 
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Extended Data Table 3 | Summary of phosphorylation sites of in vitro phosphorylated APC/C subunits (excluding Apc1 and Apc3) identified 


by mass spectrometry 
Fi ., WT 
Comparison with Protein_|Phospho-sites_|APC/C ‘comparison with 


Cc 
Kinases used No treatment |CdK2-cyclinA3-| published studies Ref. Kinases used |No treatment |C9K2-cyclinA3-| published studies Ref. 
Cks2 + Plk1 Cks2 + PIk1 


Jeuos2zosrvA fof] VRPS119 GN 
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ST 
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The pink shading shows the presence of phosphorylation sites 
and the white indicates its absence. 
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Activation of the A», adenosine G- protein-coupled 
receptor by conformational selection 


Libin Ye!?, Ned Van Eps’, Marco Zimmer?*, Oliver P. Ernst?* & R. Scott Prosser? 


Conformational selection and induced fit are two prevailing 
mechanisms’ to explain the molecular basis for ligand-based 
activation of receptors. G-protein-coupled receptors are the largest 
class of cell surface receptors and are important drug targets. 
A molecular understanding of their activation mechanism is critical 
for drug discovery and design. However, direct evidence that 
addresses how agonist binding leads to the formation of an active 
receptor state is scarce®. Here we use !°F nuclear magnetic resonance 
to quantify the conformational landscape occupied by the adenosine 
Aza receptor (A2,4R), a prototypical class A G-protein-coupled 
receptor. We find an ensemble of four states in equilibrium: (1) two 
inactive states in millisecond exchange, consistent with a formed 
(state S,) and a broken (state S,) salt bridge (known as ‘ionic lock’) 
between transmembrane helices 3 and 6; and (2) two active states, 
S;3 and Sy, as identified by binding of a G-protein-derived peptide. 
In contrast to a recent study of the 37-adrenergic receptor’, the present 
approach allowed identification of a second active state for A24R. 
Addition of inverse agonist (ZM241385) increases the population of 
the inactive states, while full agonists (UK432097 or NECA) stabilize 
the active state, S3,, in a manner consistent with conformational 
selection. In contrast, partial agonist (LUF5834) and an allosteric 
modulator (HMA) exclusively increase the population of the S; state. 
Thus, partial agonism is achieved here by conformational selection 
of a distinct active state which we predict will have compromised 
coupling to the G protein. Direct observation of the conformational 
equilibria of ligand-dependent G-protein-coupled receptor and 
deduction of the underlying mechanisms of receptor activation 
will have wide-reaching implications for our understanding of the 
function of G-protein-coupled receptor in health and disease. 

A myriad of signalling processes associated with vision, sensory 
response, neurotransmitter- and hormone-mediated response, inflamma- 
tion, and cell homeostasis are governed by G-protein-coupled receptors 
(GPCRs), also called seven transmembrane helix (7TM) receptors. 
Aza is a family A GPCR and an important drug target for treating 
inflammation, cancer, ischaemia reperfusion injury, sickle cell disease, 
diabetic nephropathy, infectious diseases, and neuronal disorders’. An 
understanding of the mechanism of GPCR activation and the repre- 
sentative conformational states is key to the drug design process. Our 
molecular perspective of activation is biased by X-ray crystallography, 
where the receptor is stabilized through thermostable mutants, fusion 
protein constructs, and appropriate ligands to obtain a single lowest- 
energy structure, often designated as either ‘inactive’ or ‘active’. Using 
‘°F NMR and judiciously placed tags, we observed A2aR ina dynamic 
equilibrium between two inactive and two active states. The activation 
process can thus be viewed from the perspective of populations of key 
functional states, and the action of ligands on this conformational land- 
scape through conformational selection. 

X-ray crystal structures of AR, stabilized either by inverse agonist 
or by agonist, suggest that receptor activation involves a rearrangement 


of the 7TM bundle; that is, the inward shift of the intracellular part of 
TM7, a translation of TM3, and the formation of a bulge in TMS, in 
addition to an outward displacement and rotation of TM6 bringing 
together the intracellular ends of TM5 and TM6 (refs 6-8). Analogous 
observations were made for the 8-adrenergic receptor? (3,AR) and the 
light-activatable GPCR rhodopsin’, suggesting a common activation 
pathway (Extended Data Fig. 1). Via activation intermediates through 
which these TM domains rearrange, GPCRs form an increasingly larger 
crevice at the cytoplasmic side!’, which is eventually large enough to 
harbour the key binding sites of interacting G protein and arrestin??. 
We used electron paramagnetic resonance (EPR) and NMR to iden- 
tify labelling sites on TM5 and TM6. A !°F NMR label at V229C on 
TM6 (Extended Data Figs 2 and 3) appeared to be ideal for monitoring 
activation of AR (the version used in this study is truncated after 
residue 317). In assessing conformational states and studying confor- 
mational exchange of GPCRs on the microsecond to millisecond times- 
cale, both '3C and !9F NMR have proved useful*!4-!8. In particular, 
‘°F NMR provides exquisite sensitivity to solvent exposure or side- 
chain packing, often revealing a wealth of conformations*!*"”. 

A recent '°F NMR study of B.AR identified four distinct states asso- 
ciated with receptor activation’. The apo form of 8,AR was populated 
solely by two rapidly exchanging conformers corresponding to the 
‘ionic lock; a salt bridge between Arg131*°° on TM3 and Glu268°*” 
on TM6, either formed (Sj) or broken (S2). An additional long-lived 
(lifetime 7 = 660 ms) 35AR active state (S3), in slow exchange with 
S, and S,, was identified upon binding of agonist*. Further addition of a 
nanobody mimicking a G protein established another, fully active state 
(S4) of 8: AR, deemed to be competent for signalling as concluded from 
the same maximally splayed cytoplasmic surface as in the 8, AReGa, 
crystal structure*’. Because neither of the two active states, S3 and Su, 
could be detected in the ligand-free apo form of 82AR, it was not pos- 
sible to distinguish between induced fit and conformational selection 
as models for 3,AR activation. 

In contrast to 82AR, the present MENMR study revealed four 
states (two inactive and two active) associated with ligand-free apo 
AzaR®*!9 (Fig. 1 and Extended Data Figs 4 and 5). Owing to striking 
parallels with the previous study of 82AR, we have adopted a similar 
nomenclature for the states. The two inactive states S, and S, are in fast 
exchange on a millisecond timescale (Extended Data Fig. 4) and are 
represented by a single resonance, designated S;_», which in analogy 
to 3AR flickers between an ionic lock stabilized (S;) and broken state 
(Sz). Corresponding states are seen in Az,R crystal structures: a ther- 
mostablized Az4R mutant with inverse agonist bound reveals an intact 
ionic lock between Arg102?*° and Glu228°” (ref. 19), whereas AzaR 
structures with either antagonist® or agonist* bound show a broken 
ionic lock. Two upfield shifted resonances are associated with active 
states, S3 and Sy, as identified by binding of G-protein-derived pep- 
tides (see below). In stark contrast to 82AR, the active states $3 and S3 
are already present in the A24R apo form and their populations are 
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Figure 1 | Ligand-dependent A, R state equilibria. Three distinct 
resonances of !F-labelled Az,R-V229C are associated with inactive 

(S;_2, shown in red) and active states (S3, shown in green, and S;, shown 
in blue), as a function of representative ligands. a, °F NMR spectra of 

the receptor in the apo form or in the presence of inverse agonist, partial 
agonist, or full agonist, respectively. b, 1k NMR spectra of the receptor in 
the apo form and with increasing amounts of NECA agonist. c, Histogram 
obtained from spectral deconvolutions, comparing the relative populations 
of S}_, S3, and Sy states. d, Histogram comparing the relative populations 
of states, S}_, S3, and Sy, upon titration of the full agonist NECA to AzaR. 
Experiments were replicated at least three times from separately expressed 
and reconstituted samples. Details on the chemical shift referencing, 

line shape fitting procedure, and error analyses are provided in the 
Supplementary Information and Extended Data Figs 5 and 6. 


increased by the addition of partial agonist or full agonist, respectively. 
Addition of ligand merely alters the distribution of states in a manner 
consistent with conformational selection. 

Figure 1 and Extended Data Fig. 6 show °F NMR spectra of 
2-bromo-N-(4-(trifluoromethyl)phenyl) acetamide (BTFMA)- 
labelled Az4R-V229C as a function of representative ligands (inverse 


a Apo b + Partial agonist (LUF5834) 


C + Full agonist (UK432097) 


agonist (ZM241385), partial agonist (LUF5834), and two full agonists 
(UK432097 and NECA)). Three resonances associated with states, S1_», 
S3, and S3, can be identified in all of the spectra. Addition of inverse 
agonist shifts the equilibrium towards the S_, ensemble. Addition of 
the partial agonist LUF5834 stabilizes S3. The allosteric modulator 
HMA has the same effect on S3 with the caveat that the resonance 
associated with S_2 appears to be exchange broadened (Extended Data 
Fig. 7). Finally, full agonists (UK432097 or NECA) shift the equilibrium 
towards $3. The chemical shifts associated with S1_2, S3, and S3 are 
observed to be increasingly upfield, consistent with a corresponding 
increase in solvent exposure of the probe”? and opening of the cyto- 
plasmic crevice via rotation and translation of TM5 and TM6. The 
corresponding state populations are obtained directly from signal 
deconvolutions (Fig. 1a, b) and are provided as histograms (Fig. 1c, d). 

While inactive states S; and S; undergo exchange on a low milli- 
second timescale, exchange between the inactive state ensemble S,_, 
and the active states $3 and S3' is of the order of 1 or 2s, as shown by sat- 
uration transfer experiments (Fig. 2). In this case, the inactive ensemble 
can be selectively saturated by application of a low power pulse applied 
at a frequency, Vs, slightly downfield from the resonance associated with 
S}-2. By recording spectra as a function of the duration of the pulse, it 
is possible to determine the rate of exchange between S3 and Sj_, or 
equivalently the lifetime of the S3 intermediate state, Ts,. Conversely, 
the lifetime of the inactive ensemble, 7¢,_,, can be determined by sat- 
urating the active states (S3 and S3’) as described in Extended Data 
Fig. 8. Note that because of overlap between S; and Sy, it is difficult to 
measure their mutual exchange. The saturation transfer experiments 
(Fig. 2) reveal that the S; state is long-lived (1-3 s) for A>,4R in the apo 
form or when bound to either inverse agonist or partial agonist. The 
addition of agonist (UK432097) appears to shorten the lifetime of the 
S3 state, which may be a consequence of lowered barriers, and, hence, 
exchange between S; and both Sj_, and the Sy states. The saturation 
transfer experiments are further consistent with a sequential transition 
S3 > S3 — S)_2 (Extended Data Fig. 8b, c). 

A sequence of GPCR states where the receptor becomes gradually 
more active has been shown for the photoreceptor and GPCR rhodop- 
sin!!! According to this sequence of reaction steps, formation of 
the fully active receptor state is concomitant with a proton uptake from 
the aqueous environment to the conserved D(E)RY motif on TM3. 
We therefore recorded pH-dependent !?7F NMR spectra of BTFMA- 
labelled Az,R-V229C in the apo form and in the presence of saturating 
amounts of NECA agonist (Fig. 3). With decreasing pH, the popula- 
tion of states shifted towards the S3 state at the expense of S;_) and S3, 
as expected for a coupled equilibrium where the last transition from 
S3 to S3 is pH-dependent. The pH-dependent population of the Sy state 
was more pronounced in the presence of NECA agonist (Fig. 3a). An 
analogy is seen with opsin (the apo form of rhodopsin), which is also 
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(UK432097; c). To account for off-resonant 
saturation effects due to the pulse at a frequency, 
vs, a control experiment was performed at a 
frequency, 1, equidistant to the peak of interest. 
The effective decay curve (green dashed line) 
represents the approximate response of S; 
associated with selective saturation to S,_>. 
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Figure 3 | Effect of pH and Ga,-derived peptide on A24R conformational 
states. a, °F NMR spectra of BTFMA-labelled Az,R-V229C at various pH 
values for the Az4R apo form (left) and Aj4R saturated with NECA agonist 
(right). b, Native gel of BTFMA-labelled Az,4R-V229C in the apo form or 
in the presence of saturating amounts of partial agonist (LUF5834) or full 
agonist (NECA), respectively (lanes 1-3). The presence of a Ga,-derived 
peptide causes a mobility shift (lanes 5-7). c, °F NMR spectra of BTFMA- 
labelled A2,R-V229C in either the apo form or in the presence of saturating 
amounts of partial agonist (LUF5834) or full agonist (NECA) in absence 
and presence of Ga,-derived peptide. Ligand and peptide concentrations 
were 50 x LUF5834, 100 x NECA and 50 x Ga,-derived peptide, 
respectively, relative to the receptor concentration. 
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In Fig. 3b, c, we examine the effect of a peptide derived from 
the carboxyl (C)-terminal domain of the G-protein Ga, (RVF 
NDARDIIQRMHLRQYELL)” on the equilibrium of Az,R states. 
'°F NMR data and mobility shifts in native gels showed that the peptide 
is able to interact with the apo receptor and Az,R saturated with partial 
agonist or full agonist. Addition of the peptide reduced the inactive 
state ensemble population and shifted the equilibrium towards the 
S3 and Sy states, identifying both states as active as characterized by 
their capability to interact with the Gas-derived peptide. S3 and S3 
states have different conformations and thus may vary in their capacity 
to activate G protein. In the presence of saturating amounts of full ago- 
nist, addition of the Ga; peptide resulted in a pronounced shift towards 
S3 and $3, whereas in the presence of partial agonist the S3 intermediate 
prevailed without population of Sy. The spectra thus demonstrate that 
the Ga, peptide is able to bind either S; or Sy states in a manner con- 
sistent with conformational selection. Moreover, a closer inspection 
of the apo spectrum suggests that the peptide preferentially binds to 
S3 over S3, which is not the case in the presence of a partial agonist. 
Rather, the partial agonist stabilizes the S3 state, and addition of 
Ga,-derived peptide only reinforces this state. This probably directly 
relates to a reduced efficiency of binding and activation of the holo G 
protein when partial agonist stabilizes Az. 4R. 

The activation process associated with GPCRs is probably best 
understood in the case of visual rhodopsin with its covalently bound 
chromophore 11-cis-retinal'!!??'. Light absorption causes cis/trans 
isomerization and thus in situ conversion of an inverse agonist into an 
agonist. The fully active G-protein-interacting state develops sequen- 
tially through a series of metarhodopsin states which are in equilibrium 
and are stabilized by proton uptake. We find a remarkable similarity 
for Az,R with inactive and active states, which find their counterparts 
in the rhodopsin activation scheme as proposed earlier?!. The opsin 
apo form exists in a pH-dependent conformational equilibrium” and 
retinal uptake is suggested to occur via conformational selection”. 

The current NMR data reaffirm the idea that key functional states 
simultaneously exist within a dynamic and ‘loosely coupled’ ensemble” 
of the unliganded receptor, as depicted in Fig. 4. Inactive and active 
states exchange slowly, as has been previously noted in studies of other 
GPCRs***?7, The corresponding high activation barriers probably 


Figure 4 | Model of the free energy landscape 
and corresponding model of A2,4 receptor 
activation. a, The effects of inverse agonist, 
partial agonist, and full agonist on the state 
equilibria are illustrated in the free energy 
landscapes. The functional states (Sj_2, S3, 
and S3/) are characterized as sitting in deep 
free-energy wells, while undergoing relatively 
slow exchange. Ligands affect this landscape 
in a manner consistent with conformational 
selection. b, Binding of Ga, to apo AxaR 

is enabled through the active state ensemble. 
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play a key role in regulation of signalling. Despite these barriers, basal 
activity of a receptor such as Aj,4R would be expected to occur owing 
to the presence of S3 and, presumably to a lesser extent, $3. An inverse 
agonist shifts the equilibrium towards the inactive ensemble, S;_>, and 
suppresses the basal population of active states. 

The addition of partial agonist or full agonist further stabilizes the 
respective active states, consistent with the notion of conformational 
selection’’, while it is also clear that ligands influence barrier heights 
associated with activation, as exemplified by the observation that HMA 
resulted in faster exchange between S,_) and S3 (Extended Data Fig. 7). 
We note that 70% of the unliganded receptors adopt the active states, 
S3 or S3/, in contrast to 2AR, where the receptor was biased towards 
the inactive ensemble and active states could only be detected with 
agonist and/or nanobody’*. It may be that, in the absence of agonists, 
the active states associated with 32AR are either very weakly populated 
or are short-lived and therefore exchange-broadened to the point where 
they cannot yet be detected. The coexistence of S3 and Sy in AzaR also 
underlies the concept of partial agonism, which refers to a pharma- 
cological phenomenon where the addition of saturating amounts of 
a given agonist results in sub-maximal signalling or efficacy. There 
are two schools of thought as to how partial agonism might originate 
at a molecular level. Quite simply, a partial agonist may establish an 
unstable (short-lived) fully active state”®. In this case, the peak associ- 
ated with a partial agonist would be represented by a weighted average 
between resonances associated with the fully active and other active and 
inactive states and might also exhibit exchange broadening. The second 
possibility is that a partial agonist sub-optimally engages the ortho- 
steric binding site such that the active conformation is simply not fully 
established (that is, $3 in AzaR). The two distinct frequencies, which 
prevail for any ligand tested, imply the existence of two inherently stable 
states, whose relative populations are determined by the ligand and/or 
environment. The addition of partial agonist would result in a ‘less 
open A2aR conformation, leading to weaker allosteric coupling with 
the G protein, than that attained through the ‘more oper’ Sy state. This 
notion that the conformation of the partial agonist stabilized state is not 
fully competent to engage with the G protein is reminiscent of a recent 
description of partial agonism in terms of the triggering of multiple 
switches in the receptor*”. 

We note that V229C was selected as a labelling site, as discussed in 
the Methods, to discriminate between active and inactive states, identi- 
fied by crystallography, while minimizing expression losses, misfolding, 
or loss of function. A survey of other domains may reveal additional 
nuances associated with the activation process. We have been able to 
demonstrate that ligand binding to A4R occurs through conforma- 
tional selection rather than induced fit. This has important ramifi- 
cations for drug design as it implies that any therapeutic compound 
would ideally favour a pre-existing receptor state. Accordingly, choice 
of the interacting state would then dictate pharmacology. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 


No statistical methods were used to predetermine sample size. The experiments 
were not randomized. The investigators were not blinded to allocation during 
experiments and outcome assessment. 

Plasmid construction and transformation. The full-length human A24R 
gene, originating from construct pPIC9K_ADORA2A*|, was provided by T. 
Kobayashi. The construct M-TEV-hA2aRT1316-H10 (with a TEV protease cleav- 
age site insert) was engineered from the construct M-hA2aART1316-H10, which 
was provided by R. Grisshammer™. A gene fragment, F,—Factor-Flag-TEV- 
A2aARTr316-H10, with components derived either from pPIC9K_ADORA2A 
or from M-TEV-hA2aRTr316-H10 was amplified by fusion PCR with primers 
listed in Extended Data Table 1. pPIC9K_ADORA2A and F,,-Factor-Flag-TEV- 
A2aARTr316-H10 were digested with BamHI-HF and NotI-HF (New England 
Biolabs) restriction enzymes. The isolated F,.-Factor-Flag-TEV-A2aARTr316-H10 
fragment was subcloned into the pPIC9K plasmid to generate the new plasmid 
pPIC9K_F,,-Factor-Flag-TEV-A2aARTr316-H10. The construct pPIC9K_F,- 
Factor-Flag-~TEV-A2aART1r316-H10_V229C containing the V229C mutation was 
generated using a QuikChange Lightning Site-Directed Mutagenesis Kit (Agilent 
Technologies) with primers listed in Extended Data Table 1. All constructs were 
sequenced by a local DNA sequencing facility (The Centre for Applied Genomics, 
Sick Kids Hospital, Toronto, Canada) with the AOX1 primer pair of PF,ox; and 
PRaoxu listed in Extended Data Table 1. The proteins resulting from plasmids 
pPIC9K_F,,-Factor-Flag-TEV-A2aARTr316-H10 and pPIC9K_F,-Factor-Flag- 
TEV-A2aARTr316-H10_V229C were designated as Az,R (Pro2 to Ala317) and 
AzaR-V229C (Pro2 to Ala317 including the V229C mutation), respectively, on the 
basis of their correspondence to the wild-type sequence. Freshly prepared compe- 
tent cells of a strain of Pichia pastoris SMD 1163 (Ahis4 Apep4 Aprb1, Invitrogen) 
were electro-transformed with Pmel-HF (New England Biolabs) linearized plas- 
mids using a Gene Pulser II (Bio-Rad). High-copy clone selection was performed 
as previously described*’, and a high-yield construct was then screened by an 
immunoblotting assay for further expression. 

Receptor expression, purification and labelling. A pre-cultured single colony 
on YPD (1% (w/v) yeast extract, 2% (w/v) peptone and 2% (w/v) glucose) plates 
containing 0.1 mg ml“! G418 was inoculated into 4ml YPD medium and cultured 
at 30°C for 12h, then transferred into 200 ml BMGY medium (1% (w/v) yeast 
extract, 2% (w/v) peptone, 1.34% (w/v) YNB (yeast nitrogen base) without amino 
acids, 0.00004% (w/v) biotin, 1% (w/v) glycerol, 0.1 M PB (phosphate buffer) at 
pH 6.5) and cultured at 30°C for another 24h with an absorbance As95nm in the 
range 2-6. The cell pellets were spun down at 4,000 for 5 min and were then resus- 
pended in 1l of BMMY medium (1% (w/v) yeast extract, 2% (w/v) peptone, 1.34% 
(w/v) YNB without amino acids, 0.00004% (w/v) biotin, 0.5% (w/v) methanol, 
0.1M phosphate buffer at pH 6.5, 0.04% (w/v) histidine and 3% (v/v) DMSO, 
10mM theophylline) at 20°C. Methanol (0.5% (v/v)) was added every 18h. Sixty 
hours after induction by methanol, cells were harvested for purification. 

The cell pellets were collected by centrifugation at 4,000g for 10 min, and washed 
twice with washing buffer (50 mM HEPES, 10% glycerol, pH 7.4) before addition of 
breaking buffer (50 mM HEPES, pH 7.4, 100mM NaCl, 2mM EDTA, 10% glycerol, 
100 U Zymolyase, 1001M theophylline). The sample was kept at room temperature 
(20°C) for 1h before disruption by vortexing for 2h at 4°C. Intact cells and cell 
debris were separated from the membrane suspension by low speed centrifugation 
(8,000g) for 30 min. The supernatant was collected and centrifuged at 100,000g 
for 1h, and the precipitated cell membrane was then immediately dissolved 
in membrane lysis buffer (50 mM HEPES, pH 7.4, 100 mM NaCl, 1% MNG-3 
(lauryl maltose neopentyl glycol) and 0.2% CHS (cholesteryl hemisuccinate), 100 1M 
theophylline, and 20 mM imidazole) under continuous agitation for 1-2h at 4°C 
until the membrane was dissolved. Subsequently, Talon resin (Clontech) was added 
to the solubilized membranes and incubated for at least 2h or overnight under 
gentle agitation. 

The Az,R-bound Talon resin was washed twice with 50 mM HEPES buffer, 
pH 7.4, containing 100 mM NaCl, 0.1% MNG-3, and 0.02% CHS and resus- 
pended in the same buffer, followed by addition of 100|1M TCEP reducing 
agent and incubation for 20 min. TCEP was washed out immediately by two 
rinsing steps with a buffer made of 50 mM HEPES, pH 7.4, 100mM NaCl, 
0.1% MNG-3, and 0.02% CHS. The Az,R-bound Talon resin was then resus- 
pended in buffer made of 50 mM HEPES, pH 7.4, 100 mM NaCl, 0.1% MNG-3, 
and 0.02% CHS, and combined with 10- to 20-fold excess of the NMR label 
(2-bromo-N-(4-(trifluoromethyl)phenyl)acetamide, BTFMA)*** or EPR 
label (3-(2-iodoacetamido)-PROXYL) in the presence of nitrogen and under 
gentle agitation overnight at 4°C. At the same time, 20,1] of TEV enzyme was 
added to remove the A2,4R amino (N)-terminal tag. Another aliquot of NMR 
label was then added and incubated for an additional 6h to ensure complete 
labelling. After the labelling and removal of the N-terminal tag was complete, 
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the Az4R-bound Talon resin was extensively washed in a disposable column 
with buffer containing 50 mM HEPES, pH 7.4, 100 mM NaCl, 0.1% MNG-3, 
and 0.02% CHS, and apo AR was then eluted from the Talon resin with 50 mM 
HEPES, pH 7.4, 100 mM NaCl, 0.1% MNG-3, and 0.02% CHS, 250mM imidazole 
and concentrated to a volume of 5 ml. NaCl and imidazole in the sample were 
then removed by dialysis against 100 ml of 50 mM HEPES, pH 7.4, 0.1% MNG-3, 
0.02% CHS for 3h. The XAC-agarose gel (antagonist xanthine amine congener 
(XAC) conjugated to Affi-Gel 10 resin) and Az,R were then incubated together 
for 2h under gentle agitation. Functional A2,R was eluted with 50 mM HEPES, 
pH 7.4, 0.1% MNG-3, 0.02% CHS, 100 mM NaCl, 20 mM theophylline. The eluted 
samples were concentrated to 20 ml by centrifugal filtration (molecular weight 
cut-off 3.5 kDa), and Talon resin was added and incubated under gentle agitation 
for another 2h to bind functional Az,4R. Functional A»,R bound to the resin was 
washed extensively with buffer containing 50 mM HEPES, pH 7.4, 100mM NaCl, 
0.1% MNG-3, 0.02% CHS, and 20 mM imidazole, to remove all theophylline. 
Then, functional apo A24R was eluted with elution buffer (50 mM HEPES, pH 7.4, 
100mM NaCl, 0.1% MNG-3 and 0.02% CHS, 250 mM imidazole) and the sample 
was dialysed to remove imidazole and concentrated for NMR or EPR. 

Choosing labelling sites based on X-ray crystal structures. PROSHIFT software* 
(http://www.meilerlab.org/index.php/servers/show?s_id=9) was used to predict 
Ca chemical shift differences between the crystal structures of the active NECA- 
bound state (Protein Data Bank (PDB) accession number 2YDV) and the inactive 
ZM241385-bound state (PDB accession number 3EML). For the present study, we 
focused on differences associated with transmembrane domains TM3, TM5, and 
TM6, in an effort to characterize different active states. 

Double electron-electron resonance and continuous wave EPR experiments. 
Site V229C**! was spin-labelled with 3-(2-iodoacetamido)-PROXYL to generate 
a paramagnetic nitroxide side chain. X-band continuous wave (CW)-EPR data of 
the spin-labelled apo Az,R were acquired using a Bruker ELEXSYS E500 CW-EPR 
spectrometer coupled to an ER 4123D dielectric resonator. The field sweep for data 
collection was 100-G and modulation amplitude was 2-G. Data sets were typically 
averages of 30-50 scans. Double electron-electron resonance (DEER) spectroscopy 
was used to verify that only a single EPR active spin label was attached to A24R. The 
identical sample as in the continuous wave experiment was used to collect Q-band 
DEER data using a Bruker ELEXSYS E580 spectrometer. Data were analysed using 
the program ‘LongDistances’ developed by C. Altenbach, which is available for 
download at http://www.biochemistry.ucla.edu/biochem/Faculty/Hubbell. 

NMR experiments. NMR samples typically consisted of 25011 volumes of 50-200 1M 
19°F labelled AzaR-V229C in 50mM HEPES buffer and 100mM NaCl, doped with 
10% D,0. The receptor was stabilized in 0.1% MNG-3 and 0.02% CHS. All °F 
NMR experiments were performed on a 600 MHz Varian Inova spectrometer 
using a cryogenic triple resonance probe, with the high-frequency channel tunea- 
ble either to 'H or to °F. Typical experimental setup included a 23 1s 90° excitation 
pulse, an acquisition time of 200 ms, a spectral width of 15 kHz, and a repeti- 
tion time of 1s. Most spectra were acquired with 15,000 scans, which provided a 
signal-to-noise ratio of roughly 100. Processing typically involved zero filling, and 
exponential apodization equivalent to 15 Hz line broadening. 

Tz measurements. !’F-labelled apo A2,R-V229C (200 |1.M; comprising amino acids 
2-317) in the buffer as described above was used for measurements of transverse 
relaxation time (T,) by a CPMG T) pulse sequence, using a refocusing period of 
133 1s, with a total transverse magnetization evolution time of 0.4, 0.8, 1.2, 1.6, 
2.0, 2.4, 2.8, 3.2, and 3.6 ms. 

°F saturation transfer experiments. To investigate slow exchange between 
resolved states, !°F chemical exchange saturation transfer NMR experiments were 
performed, in which a series of continuous-wave irradiation pulses were applied 
both at an on-resonance frequency (1) and at an off-resonance frequency (1,), to 
assess chemical exchange during steady-state saturation and off-resonant satu- 
ration effects, as shown in Fig. 2. Upon saturating the resonance associated with 
state B, the ideal magnetization response of A may be described by the formula*® 


Pa 


A k 
MA = Mé| —48 — —_}, 
Pa t+ kas 


exp[—T(4 + kas)] + 
Py + kas . 


assuming off-resonant effects are accounted for. Note that both the exchange rate 
constants, k4g, and the longitudinal relaxation rate of spin A, a, can in principle be 
calculated from a fit of the above equation to the experimental data. Accordingly, 
the lifetime 7, can be calculated from 7 = 1/k,g. All fits were performed using 
Gnuplot (http://www.gnuplot.info). 
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Extended Data Figure 1 | Comparison of inactive and active GPCR (2-methylphenyl)ethyl] amino}-1-hydroxyethyl]-5-hydroxy-2H-1,4- 
crystal structures. a, Inactive state A24R (cyan, inverse agonist ZM241385 benzoxazin-3(4H)-one) bound, PDB accession number 3SN6). ¢, Inactive 
bound, PBD accession number 4EIY) and active state A.,R (brown, rhodopsin (purple, inverse agonist 11-cis-retinal bound, PDB accession 
agonist UK432097 bound, PDB accession number 3QAK). b, Inactive number 1U19) and active metarhodopsin I] (blue, agonist all-trans-retinal 


state BAR (green, inverse agonist carazolol bound, PDB accession number __ bound, PDB accession number 3PQR). 
2RH1) and active state 3,AR (red, agonist (8-[(1R)-2-{[1,1-dimethyl-2- 
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Extended Data Figure 2 | Secondary structure and topology of 
C-terminally truncated A2,R-V229C. Residues 2-317 of Az,R are 
preceded by an Ala residue resulting from TEV protease cleavage, and are 
succeeded by an Ala—Hisj9 sequence. Az,R was expressed in P. pastoris 
(SMD1163 strain) through genomic integration of a pPIC9K vector with 
a leader sequence consisting of «-Factor, Flag tag (DYKDDDDK), anda 
TEV protease recognition domain (SNNNNNNNNNNLGENLYFQGA). 
During the secretion process, the signal peptide of the a-Factor gets 
cleaved and the domain associated with the Flag tag and TEV recognition 
domain is removed by TEV protease. a, The truncated wild-type receptor 
used in this study contains all four native disulfide bonds and six buried 
cysteine residues (indicated in red), none of which were perturbed by the 
labelling process, which was specific for the introduced (solvent-exposed) 


Cytoplasmic 
Domain Cc 


(NMR) HN CF, 


a. 


cysteine residue V229C®*! (shown in green with yellow background; the 
superscript refers to the Ballesteros—-Weinstein numbering’’). b, A surface 
map suggests V229C (green, solvent exposed) should be fully labelled 
without perturbing the receptor. c, Structures of protein-attached labels for 
NMR (BTFMA; 2-bromo-N-(4-(trifluoromethyl)phenyl)acetamide) and 
EPR (PROXYL,; 3-(2-iodoacetamido)-PROXYL) analysis. d, e, Location 
and topology of the labelling site associated with V229C for both the 
inverse agonist (inactive, grey) and agonist-bound (active, yellow) states 
(PDB accession numbers 4EIY and 3QAK). Two rotamers of the BTFMA 
label are indicated in green and purple (the phenyl moiety is shown as a 
sphere). Note that the size of the tags is slightly larger than that depicted 
in the figure. The environment around the tag is predicted to differ for 
inactive and active states of the receptor. 
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Extended Data Figure 3 | Labelling efficiency of Az,R-V229C. a, Single micelles. b, DEER measurement of 50|1M PROXYL spin-labelled apo 
cysteine CW-EPR spectrum of 501M apo Az24R-V229C receptor, labelled AzaR-V229C receptor. c, '7F NMR spectra of protease-K-digested 
with a PROXYL spin-label and reconstituted into MNG-3 detergent MB-labelled Az ,R-V229C, showing one dominant peak. 
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Extended Data Figure 4 | Car-Purcell-Meiboom-Gill (CPMG) 
relaxation dispersion experiment to evaluate dynamics of S,_». 

a, °F NMR CPMG relaxation series of !F-labelled apo AzaR-V229C. 

Each spectrum was acquired using 10,000 scans with a constant T>-refocusing 
period of 3.5 ms. The spectra in the relaxation series were recorded with 
different refocusing frequencies (that is, different periods between the 
refocusing pulses as indicated above, representative of three experiments). 
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The sample consisted of 200M !°F-labelled apo A2,R-V229C in 50mM 
HEPES buffer (pH 7.4) and 100 mM NaCl. b, CPMG curve for the S;_» 
peak (red diamonds) and reference peak (black triangles). S;_. undergoes 
millisecond timescale exchange while the reference peak exhibits no 
dispersion. c, Cartoon illustrating S, and S) exchange in addition to the 
activation intermediates. 
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the most downfield peak is ascribed to $)_2, which results from the rapid 
flickering of the ionic lock from ‘on’ (S;) to ‘off’ (S2), as evidenced by the 
CPMG measurements in Extended Data Fig. 4. Thus, we propose a total 
of four states, three of which may be spectroscopically resolved. The 
resonance frequencies chosen in the fit for S; and S3 were based on the 
observed peaks seen in the presence of agonists and those identified at 
pH 6, where S; and S3 are better resolved. The fitted line widths are also 
comparable to the homogeneous line widths, estimated from the above 


Extended Data Figure 5 | Comparison of two- and three-state models 
of !°F-labelled Ay,R-V229C. a, !°F NMR 7) relaxation series of the 
B_labelled apo A2,4R-V229C receptor. b, Exponential fit to T for the 
downfield and upfield resonances, A and B in a. c, Deconvolution of the 
9B NMR spectrum for !?F-labelled apo Ay ,R-V229C receptor assuming 

a two-state model. The fitted line width of the upfield resonance is roughly 
twice that estimated from the T; measurement, suggesting the upfield 
resonance may be better represented as a superposition of two Lorentzian 
lines, associated with S3 and S3, as discussed in the Supplementary T> experiment. Note that the difference spectrum (that is, the experimental 
Information. d, Spectral deconvolution of the !7F NMR spectrum of the spectrum minus spectral deconvolution) associated with the fit is shown in 
F_labelled apo A24R-V229C receptor assuming three states. Note that blue in cand d. 
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Extended Data Figure 6 | °F NMR spectra of !°F-labelled Az,R-V229C difference between the sum of the three deconvolved resonances and the 
in the presence of 50- or 100-fold excess of different ligands. observed spectrum, is shown in each case. Note that the chemical shifts 
Representative (N= 3) !°F NMR spectra as a function of ligands in the deconvolutions were referenced to the standard (—59.050 ppm) 
(inverse agonist (ZM241385), partial agonist (LUF5834), and full agonists and estimated to be —61.08 ppm (Sj_2 (red)), —61.60 ppm (S; (green)), 
UK432097, as shown in Fig. la. The downfield peak represents a reference | and —61.85 ppm (S; (blue)), respectively. Corresponding line widths were 
peak resulting from the addition of 101.M bendroflumethazide. Note estimated to be 220 Hz, 230 Hz, and 260 Hz, respectively. 

that a difference spectrum (shown in dark blue), corresponding to the 
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Extended Data Figure 7 | The role of HMA in the receptor activation 
process. a, °F NMR spectra of !°F-labelled apo A7,R-V229C and 
B-labelled Az,R-V229C in the presence of saturating amounts of the 
amiloride ligand 5-(N,N-hexamethylene) amiloride (HMA). Addition 

of 50-fold excess of HMA results in an increase in the $3 fraction and an 
apparent exchange broadening and slight coalescence of S;_2 and S3, which 
are represented by the deconvolutions in lavender and green, respectively. 
After accounting for the exchange process between S;- and S3 by 
assuming k,, = 600 Hz, the simulated spectrum (shown in red) compares 
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favourably with the observed spectrum. If we assume that exchange 
between S,_, is slow, we then obtain the ‘rigid’ lattice spectrum, shown 
in black. b-d, '°F NMR spectra of !°F-labelled A24R-V229C showing the 
effect of the addition of 50-fold excess of HMA to saturating amounts 

of inverse agonist (100 x ZM241385) and agonist (50 x UK432097 or 
100 x NECA). In all cases, addition of HMA competes with the bound 
ligand and establishes a greater fraction of the S; state. The three 
deconvolved resonances are shown in red, green, and blue. 
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Extended Data Figure 8 | Saturation transfer experiments of 

19F labelled Az,R-V229C. a, °F NMR spectra of !°F-labelled apo AzaR- 
V229C with corresponding decay curves associated with continuous wave 
saturation of either the active state ensemble, S3 + S3/, or the inactive state 
ensemble, S;_2, are provided in the left and right columns, respectively. 

To account for off-resonant saturation effects, a control experiment was 
performed at a frequency, 1, such that the peak of interest was equidistant 
to the saturation frequency, vs, and the control frequency, v,. The response 
of the peak of interest (that is, $;_, and S3+ Sy in the left and right panels, 
respectively) to saturation at the control frequency, 1, is represented by 
black squares. Similarly, the response of the peak of interest to saturation 
at vs is shown in violet while the effective responses, accounting for 
off-resonant saturation, are shown in red (S;_») and green (S;+ Sy). 
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On the basis of the effective decay profiles, and using a two-site 

exchange model, the lifetime of the inactive state ensemble and active 
states is estimated to be 1.6s and 9s. Spectral deconvolutions allow 

us to estimate the populations, p(S;_2) and p(S3 + S3’), to be 0.28 and 0.72, 
respectively. Using the fitted forward rate constant, kz = 0.62 Sa; 

the reverse rate constant is estimated to be kg4 = 0.2487}, assuming 

kap X p(Si-2) =kpa X p(S3 +S). In contrast, the response to the saturation 
of S,_2 provided an estimate of kg4=0.11+0.03s~!. b, Saturation transfer 
experiments of full agonist UK432097-bound '°F-labelled A2,4R-V229C. 
The effective decay curve (blue dashed line), associated with saturation of 
S}-2 is consistent with a process where S3 magnetization is exchanged with 
S}-2 via S3, as suggested by the figure in c. c, Model for presumed exchange 
pathway between S,_, $3, and S3). 
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Extended Data Table 1 | Primers/gene fragments used to construct plasmids for this study 


Primer/ Sequences Constructs 
fragment 
TEV 5’-TCTAACAACAACAACAACAACAACAACAACAACAACCTTGGCGA pPIC9K_ Fa-Factor-Flag—TEV- 
fragment AAACTTIGTATTTCCAGGGCGCT-3’ A2aARTr1316-H10 
PicP1-J 5’-ATTCGAAGGATCCAAACGATGAGATTTC-3’ (BamHI) pPIC9K_ Fo-Factor—-Flag—TEV- 
A2aARTr1316-H10 
PicP2-J 5’-GTTGTTGTTGTTGTTAGACTTATCGTCATCGTCCTTGTAGTCTC-3’ pPIC9K_Fu-Factor-Flag—TEV- 


A2aARTr1316-H10 
PicP3-H 5’°GGACGATGACGATAAGTCTAACAACAACAACAACAACAACAACAACA | pPIC9K Fu-Factor—Flag—TEV- 


AC-3’ A2aARTr316-H10 
PicP41-H 5’- TGCCTTGAAAGGTTCTTGCTGCC-3’ pPIC9K_ Fo-Factor—-Flag—TEV- 
A2aARTr316-H10 
PicP4-H 5’-ATTCGCGGCCGCTCAGTGATGGTGATGGTGATGGTGATGGTG pPIC9K_Fa-Factor—-Flag—TEV- 
ATGTGCCTTGAAAGGTTCTTGCTGCC-3’ (NotI) A2aARTr316-H10 
Para v229¢ 5’- CCACACTGCAGAAGGAGTGCCATGCTGCCAAGTCAC-3’ pPIC9K_Fo-Factor—Flag— 
TEV-A2aART1316-H10 V229C 
PFaoxi 5'- GACTGGTTCCAATTGACAAGC-3' sequencing 
PRaoxi 5’- GGCAAATGGCATTCTGACATCCT-3’ sequencing 


© 2016 Macmillan Publishers Limited. All rights reserved 


LETTER 


doi:10.1038/nature17656 


Architecture of the mitochondrial calcium 


uniporter 


Kirill Oxenoid’*, Ying Dong**, Chan Cao!3*, Tanxing Cui’, Yasemin Sancak‘, Andrew L. Markhard‘, Zenon Grabarek*, 
Liangliang Kong’, Zhijun Liu’, Bo Ouyang’, Yao Cong’, Vamsi K. Mootha‘* & James J. Chou! 


Mitochondria from many eukaryotic clades take up large amounts 
of calcium (Ca?*) via an inner membrane transporter called the 
uniporter. Transport by the uniporter is membrane potential 
dependent and sensitive to ruthenium red or its derivative 
Ru360 (ref. 1). Electrophysiological studies have shown that the 
uniporter is an ion channel with remarkably high conductance and 
selectivity”. Ca?* entry into mitochondria is also known to activate 
the tricarboxylic acid cycle and seems to be crucial for matching the 
production of ATP in mitochondria with its cytosolic demand’. 
Mitochondrial calcium uniporter (MCU) is the pore-forming 
and Ca?*-conducting subunit of the uniporter holocomplex, 
but its primary sequence does not resemble any calcium channel 
studied to date. Here we report the structure of the pore domain 
of MCU from Caenorhabditis elegans, determined using nuclear 
magnetic resonance (NMR) and electron microscopy (EM). MCU is 
a homo-oligomer in which the second transmembrane helix forms 
a hydrophilic pore across the membrane. The channel assembly 
represents a new solution of ion channel architecture, and is 
stabilized by a coiled-coil motif protruding into the mitochondrial 
matrix. The critical DXXE motif forms the pore entrance, which 
features two carboxylate rings; based on the ring dimensions and 
functional mutagenesis, these rings appear to form the selectivity 
filter. To our knowledge, this is one of the largest membrane 
protein structures characterized by NMR, and provides a structural 
blueprint for understanding the function of this channel. 
Recently, genomic approaches have revealed the full molecular 
machinery of the uniporter holocomplex (uniplex)**. In vertebrates, 
this complex consists of the transmembrane (TM) domain containing 
protein MCU, its inactive paralogue MCUb, and an accessory sin- 
gle-pass TM peptide called EMRE. In addition, the complex includes 
two paralogous, EF-hand Ca’*-binding proteins MICU1 and MICU2 
in the intermembrane space. Current models of the uniporter indicate 
that MCU is the pore-forming subunit, and that MICU1/2 are Ca”*- 
sensing proteins that gate the activity of the pore based on cytosolic 
Ca?* concentrations’. EMRE is metazoan specific and appears to have 
two key functions: it maintains the pore in an open conformation, 
while additionally transducing MICU1/2 Ca” sensing to the pore’. 
There is consensus now based on several lines of evidence that 
MCU encodes the pore-forming subunit. First, loss of MCU leads 
to complete abrogation of uniporter current*!°. Second, expression 
of the MCU orthologue from Dictyostelium, an organism that does 
not have EMRE, is alone sufficient to reconstitute uniporter activity 
in yeast! Third, MCU has conserved acidic residues in the DXXE 
sequence motif at the putative entrance of the channel that are essential 
for Ca** uptake*®. Fourth, MCU monomers oligomerize into a higher 
molecular mass assembly*"!, as would be required of a pore. Fifth, a 
point mutation in MCU confers resistance to Ru360 while preserving 


uniporter current*'®, providing compelling biochemical evidence that 
MCU is the probable mechanistic target of Ru360. At present, infor- 
mation on the architecture of this pore protein is lacking. For example, 
we do not know what its oligomeric state is, which TM helix forms the 
pore, or what the structural basis of channel regulation is. 

To determine the structure of the pore domain of MCU, we used an 
approach that combines EM and NMR. MCU is predicted to be at least 
a tetramer®. Thus the full-length complex (>160 kDa) is too large for 
de novo structure determination by present solution NMR technology. 
The protein contains a soluble amino-terminal domain (NTD; 
~165 residues) that may be dispensable for channel activity’” (Fig. 1a). 
We screened several constructs of MCU with a deleted NTD, and 
found that the one from C. elegans (CMCU-ANTD) (Extended Data 
Fig. 1) could be expressed to high levels in Escherichia coli. The protein 
was extracted using foscholine-14 detergent followed by ion-exchange 
and size-exclusion chromatography in physiological buffer at pH 6.5 
(see Methods). The purified cMCU-ANTD in foscholine-14 formed 
pentamers as suggested by size-exclusion chromatography coupled to 
multi-angle light scattering (SEC-MALS) and crosslinking (Extended 
Data Fig. 2), and generated NMR spectra of good chemical shift dis- 
persion and resolution (Fig. 1b and Extended Data Fig. 3). Under the 
sample condition, CMCU-ANTD bound Ru360, and introduction of 
the Ser238Ala mutation, which was shown previously to confer resist- 
ance*!°, reduced the inhibitor binding (Fig. Ic). 

We first performed negative-stain EM reconstruction of the cMCU- 
ANTD oligomers, after diluting NMR sample, using the single particle 
analysis method (see Methods), with the goal of obtaining a global 
structural framework to aid subsequent structure determination by 
NMR. From the 12,860 automatically picked particles, we obtained 
a reconstructed 3D density map refined to a resolution of ~18 A 
(Extended Data Fig. 4). The EM map has a roughly cylindrical shape 
with five-fold symmetry (Fig. 1d), indicating that CMCU-ANTD 
forms pentamers. Despite the low resolution, the map showed many 
interesting features. One end of the cylinder has a deep hole, probably 
corresponding to the TM pore domain, whereas the opposite end is 
solid, possibly due to the formation of a coiled-coil (CC) complex by 
the predicted CC helix at the carboxy terminus (Fig. 1a,d). Moreover, 
each subunit appears to have three levels, and the middle level exhibits 
an unusual bulge. 

The pentameric complex formed by CMCU-ANTD has a total 
molecular mass of ~90,375 Da and thus represents a formidable 
challenge to NMR spectroscopy. Further challenge came from poor 
sample stability at temperatures >23 °C. Consequently, we recorded 
NMR data at two different temperatures: 23 °C for collecting the bulk 
structural restraints and 33°C for providing information on protein 
regions that showed weak NMR signals at 23 °C. The general approach 
we used for structure determination involves: (i) determination of 
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Figure 1 | NMR and EM characterization of CMCU-ANTD. 

a, Domain organization of MCU (predicted using programs COILS” 

and TMHMM”*). b, 'H-'°N TROSY heteronuclear single quantum 
coherence (HSQC) spectrum of ('*N,?H)-labelled cCMCU-ANTD oligomer 
reconstituted in foscholine-14, recorded at 900 MHz and 23°C. The peaks 
in the inset correspond to tryptophan side-chain amines. c, Isothermal 
titration calorimetry (ITC) analysis of Ru360 binding to cCMCU-ANTD 
(left) and the Ser238Ala mutant (right) under the NMR sample condition. 


local structures of the monomers, and (ii) assembly of the oligomer 
with intermonomer distance restraints'*-'°. The secondary struc- 
tures of the monomers in the oligomeric complex were determined 
mainly using local distance restraints derived from nuclear Overhauser 
enhancements (NOEs). We then used a mixed sample with differ- 
entially labelled subunits to measure exclusively NOEs between the 
'SN-attached protons of one subunit and non-exchangeable protons 
of the neighbouring subunits. This experiment provided key inter- 
monomer NOEs defining the CC, the TM pore, and the ion-selectivity 
domains (Extended Data Fig. 5). The initial structural solution then 
allowed iterative assignment of additional intermonomer and long- 
range NOEs using standard NOE experiments. The structure was 
determined using 2,070 local and 150 long-range intramonomer 
distance restraints and 220 intermonomer restraints (Extended Data 
Fig. 6a and Extended Data Table 1). 

The NMR structure of CMCU-ANTD shows a well-packed pen- 
tamer with an overall cylindrical shape similar to the EM map 
(Extended Data Fig. 6b-e). The EM structure is slightly shorter and 
wider than the NMR structure, which could have been caused by spec- 
imen flattening, a phenomenon commonly observed in negative stain 
EM!°. Both have a star-like appearance when viewed from top or bot- 
tom, and feature five bulges in the middle of the assembly that curve in 
the same direction (Figs 2a and 1d). The more detailed NMR structure 
revealed the architecture of the channel assembly. The inner core of the 
pentamer is formed with the second TM helix (TMH)2; 244-260) and 
the coiled-coil helix (CCH; residues 293-316) (Fig. 2b, c). While the 
TMH2s pack into a five-helix bundle having a largely polar pore across 
the membrane, the CCH outside the membrane forms a CC pentamer 
with a hydrophobic core, which may contribute to stabilizing the TM 
pore structure. As an independent validation of the pentameric assem- 
bly of the CCH, we showed by chemical crosslinking that a peptide 
containing the predicted CC domain (residues 289-316) in water 
forms a pentamer in agreement with the oligomeric state of CMCU- 
ANTD (Methods; Extended Data Fig. 7). The two core domains are 
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The top and bottom graphs show the raw data (ical s~' versus time) and 
normalized integration data (kcal mol“! of injectant versus molar ratio; 
molar ratio= Ru360:;:MCU-ANTD pentamer), respectively. Data fitting 
yields dissociation constant (Ky) = 24+ 8M. d, Negative stain EM 
reconstruction of the cMCU-ANTD oligomers using a protein sample 
prepared in the same way as NMR samples. Views of the final 3D volumes 
of CMCU-ANTD oligomer filtered at 18 A. 


structurally supported by peripheral helices: the TM pore domain is 
wrapped by the first TM helix (TMH1; residues 215-234) through 
contacts between TMH1 and TMH2, and the extramembrane CC 
domain is wrapped by the first helix of CMCU-ANTD (H1; residues 
180-193) (Fig. 2b, c). The two core domains are not continuous as 
TMH2 ends at Tyr260, which is immediately followed by the inner 
juxtamembrane helix (IJMH; residues 262-271) that orients at a wide 
angle relative to TMH2. The IJMH turns into an unstructured loop 
(L2; residues 272-292) before the beginning of the CCH. NMR peaks 
for many of the residues in L2 were not observed, possibly owing to 
solvent exchange and conformational heterogeneity. The two core 
domains appear to be held together on the periphery by the outer 
juxtamembrane helix (OJMH; residues 195-213), which contains a 
kink around Glu204 and interacts with the IJMH. Despite the appar- 
ent structure, NMR signals for JMH and OJMH are mostly weak due 
to exchange broadening, suggesting the membrane proximal region 
of CMCU-ANTD is intrinsically unstable. In addition to L2, another 
unstructured region is the loop preceding H1 (L1; residues 166-179); 
it is unstructured due to the absence of NTD as residues 166-171 in 
the corresponding NTD of human MCU forma short helix’”. 

The structure suggests a Ca” flow pathway through the MCU 
pore. The critical DXXE motif connecting TMH1 and TMH2 forms a 
pentameric barrel that appears to be the mouth of the pore (Fig. 3a). 
Inside the barrel, both acidic residues are in position to form two car- 
boxylate rings. The Asp240 ring is solvent exposed, and the Glu243 
ring is located deeper, guarding the entrance of the TMH2 pore com- 
mencing at Pro244. The precise ring sizes cannot be measured in our 
structure because the Asp240 and Glu243 side chains could not be 
precisely defined using the available NOE restraints (Fig. 3b), but 
the diameter of the ring defined by C8 is 7A and 11 A for Asp240 
and Glu243, respectively. These values are smaller on average than 
those of the asparagine ring in the CorA Mg”* channel (11 A)!”-?° or 
the glutamate ring in the Orai Ca?* channel (12 A)”, suggesting that 
Ca’* should be partially dehydrated by the Asp240 and Glu243 rings. 
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Figure 2 | Structure of the cCMCU-ANTD pentamer. a, Ribbon 
representation of the cCMCU-ANTD structure displaying three 

distinct layers that correspond to the transmembrane (TM; orange), 
juxtamembrane (JM; cyan), and extramembrane domain (EMD; blue) 
regions, respectively. b, Cartoon representation of the cCMCU-ANTD 
pentamer showing the formation of the uniporter core, which consists 
of the TM pore formed by TMH2 (yellow) and the coiled-coil pentamer 


Passing the acidic rings, Ca’* can readily move through the TM pore, 
which is hydrophilic and lined mostly with threonines (Fig. 3c and 
Extended Data Fig. 8). The TM pore ends with Tyr260. Upon exiting 
the TM pore from the C-terminal end, Ca’* will be in a very hydro- 
philic chamber enclosed by IJMH and OJMH on the side and the CC 
domain on the bottom (Fig. 2b, c). Ca’* cannot exit through the CC 
domain because this domain has a solid hydrophobic core. Surface 
plot also shows that interaction between IJMH and OJMH seals the 
side of the chamber leaving no visible holes for ions to exit laterally 
(Fig. 3d). Thus, the structure represents the closed conformation of the 
channel, which is consistent with the fact that metazoan MCU in the 
absence of EMRE does not transport Ca”* (refs 7, 11). We speculate 
that conformational rearrangements induced by EMRE binding are 
needed to allow Ca** to exit. By nature of being unstable, the IJMH, 
OJMH and L2 segments of each MCU monomer are the likely regions 
to undergo conformational changes that would create lateral exit paths 
near the membrane, each activated by its own EMRE peptide. 

The structure of CMCU-ANTD provides a framework for better 
understanding human MCU (HsMCU), especially because H3MCU 


formed by CCH (marine). The structure is placed in the presumed 
membrane such that the peripheral hydrophobic residues in the TM 
domain are lipid facing. c, Cartoon representation of two subunits of the 
cMCU-ANTD pentamer showing the folding of individual subunits. The 
helical segments are defined in the text. The dashed lines labelled as 

L1 and L2 correspond to the unstructured regions of the protein. 


and cMCU are orthologous, with greater than 40% sequence iden- 
tity, sharing the same protein domain organization’. We previously 
reported the use of HEK-293T cells in which we knocked out H3MCU’. 
Such cells do not exhibit mitochondrial Ca”* uptake, and represent an 
excellent system into which we can introduce mutant alleles to eval- 
uate their effect on mitochondrial Ca”* uptake. We created the NTD 
deletion in H3MCU (HsMCU-ANTD), corresponding to deletion in 
cMCU-ANTD, and found that the mutant is able to complement the 
need for HsMCU, demonstrating that the NTD in the human pro- 
tein is dispensable for calcium uptake activity (Extended Data Fig. 9). 
We then created a cysteine-free mutant of H3MCU (HsMCU“) 
akin to cCMCU-ANTD, and found that it could rescue Ca?* trans- 
port in human cells lacking MCU (Fig. 4a). We next proceeded to 
introduce mutations into H3MCU“ to validate our NMR structure 
of cCMCU-ANTD. Glu257 (Glu236 in cMCU) is highly conserved 
across eukaryotic MCU orthologues, but it is not conserved in MCUb, 
a non-conducting human paralogue of MCU. It was proposed that this 
residue is key to Ca”* conductance** because transient expression of 
Glu257Ala mutant on a partial MCU knockdown background failed 
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Figure 3 | Architecture of the pore and ion selectivity filter. a, Cartoon 
representation of the TM domain displaying the mini barrel at the mouth 
of the TM pore that contains the DXXE Ca”* selectivity elements. 

b, Enlarged view of the DXXE-containing region for showing side-chain 
conformational diversity of Asp240 and Glu243. c, TM domains of 


to rescue calcium transport fully. However, our structure (Fig. 3a, c) 
places this residue outside the entrance of the pore, and predicts it to 
be dispensable for Ca”* uptake. Indeed, consistent with this predic- 
tion, we find that when Glu257Ala or Glu257Ser mutants are stably 
expressed at levels comparable to that of wild-type protein on a clean 
MCU knockout background, they conduct Ca’* (Fig. 4b). In contrast 
to Glu257, substitutions of Asp or Glu within the DXXE motif with Ala 
disrupted mitochondrial Ca** uptake*. Our structure shows that the 
side chains of these residues provide two carboxylate rings that might 
be involved in Ca** permeation and/or selectivity. Given how critical 
these two residues appear to be in the structure, we made very con- 
servative mutations, namely Asp261Glu and Glu264Asp (Asp240 and 
Glu243 in cCMCU). While Asp261Glu is partially functional, we note 
that Glu264Asp has completely lost its ability to conduct Ca”* despite 
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Figure 4 | Functional mutagenesis of HsMCU inspired by the cMCU- 
ANTD structure. a, Cysteine-free MCU (MCU“) rescues mitochondrial 
calcium uptake to the same extent as wild-type MCU (MCUW*) in MCU 
knockout (KO) cells. b, Mutation of MCU Glu257 to Ala or Ser does not 
impair its function. c, Mutation of Asp261 to Glu permits ion permeation 
through MCU whereas Glu264 to Asp impairs function. d, Mutation of 
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Extramitochondrial [Ca?*] (a.u.) s 


subunits 1 and 3 for showing the pore-lining residues. d, The pore surface 
calculated using the program HOLE”. The region of the channel colored 
in green is only wide enough to allow passage of one water molecule, 
whereas the blue portion can accommodate two or more water molecules. 
The red region is too narrow to allow any water to pass through. 


expressing properly (Fig. 4c), underscoring the critical importance of 
this residue for permeation. 

We next considered the mechanism of action of the drug Ru360, 
a well-known inhibitor of the uniporter. Ru360 inhibits the human 
uniporter with nanomolar potency in isolated mitochondria or in per- 
meabilized cells. We previously identified Ser259 (Ser238 in CMCU) 
as being potentially important for the mechanism of action of Ru360, 
since the Ser259Ala mutant permits Ca”* conductance but confers 
nearly complete resistance against Ru360 (refs 4, 10). In our structure 
of cCMCU-ANTD, this serine residue lies at the apex of the pore, before 
the mini-barrel, raising the hypothesis that Ru360 operates by simply 
obstructing the pore. If this hypothesis is correct, then the introduc- 
tion of bulky residues into this position ought to occlude the pore. 
In fact, Ser259Arg is well expressed and does not permit any Ca** to 
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Ser259 to Arg impairs MCU function. a-d, Representative traces of Ca?* 
uptake in digitonin-permeabilized cells after addition of 501M CaCl, are 
shown on left. The bar graph shows the rate of Ca”* uptake relative to 
wild-type HEK-293T cells (mean + s.d., n= 4). Cell lysates were analysed 
by immunoblotting using anti-Flag antibody to detect expression of MCU 
protein. ATP5A was used as loading control. 
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enter, probably mimicking the effect of Ru360 (Fig. 4d). Collectively, 
these experiments demonstrate the value of our structure for under- 
standing the function of human MCU. 

The cMCU-ANTD structure represents a new solution of ion 
channel architecture. The bacterial and mitochondrial Mgt channel, 
CorA/MRS2, also forms a pentamer and has a similar domain organ- 
ization based on primary and secondary structures, but it has very 
different tertiary and quaternary fold'®”°. The TM domains of both 
CorA and MCU are composed of ten helices (two from each subunit) 
arranged in two concentric layers. Whereas the CorA pore is formed 
by TMH1, the MCU pore is formed by TMH2. The selectivity filter in 
MCU appears to be composed of two carboxylate rings: the solvent- 
accessible aspartate and the internal glutamate rings (Fig. 3a, b). In 
CorA, the solvent-accessible ring is made of five asparagines, and the 
slightly larger internal ring of backbone carbonyls of glycine residues'”. 
By contrast, the Ca** release activated Ca** channel Orai has only one 
ring of six glutamate side chains in the filter region*’. Another major 
difference between CorA and Orai on the one hand and MCU on the 
other is that after the selectivity filter, the pore in both CorA and Orai 
becomes hydrophobic for several helical turns. In MCU, however, the 
pore remains hydrophilic suggesting an explanation for fast ion con- 
duction by this channel. Finally, in Orai and CorA, the ion pathway 
continues far beyond the membrane bilayer, but in MCU the pore 
ends after traversing the bilayer, with ions likely exiting laterally near 
the membrane. In this respect the architecture of CMCU-ANTD is 
reminiscent of some channel proteins that belong to the diverse fam- 
ily of pentameric ligand gated ion channels (pLGIC), which includes 
serotonin 5-HT; and nicotinic acetylcholine receptors”?*. 

In conclusion, the combined use of NMR and EM enabled the char- 
acterization of the overall architecture of the MCU, which is also one 
of the largest membrane protein complexes studied by NMR. The 
structure represents a new pore architecture for conducting Ca" tas 
its ion-selectivity filter, ion-conducting pore, and the arrangement of 
the extramembrane domains are all different from the known Ca** 
channel structures. Although the reported structure does not show 
Ca** exit, which is consistent with being non-conducting in the 
absence of EMRE, the overall architecture suggests lateral Ca** exit 
near the middle of the channel complex. Several outstanding questions 
remain. The structure shows the selectivity filter and the pore, yet the 
detailed mechanism of Ca?* binding, transport, and exit is unknown, 
including the function of the L2 loop that could block the exit, but is 
unstructured in our model. Although we have provided compelling 
evidence that cCMCU forms a pentamer in vitro, the oligomeric state in 
vivo remains to be confirmed. Finally, MCU interactions with its TM 
partner EMRE and its regulators MICU1/2 are important subjects of 
future investigation. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 


No statistical methods were used to predetermine sample size. The experiments 
were not randomized and the investigators were not blinded to allocation during 
experiments and outcome assessment. 

Protein sample preparation. E. coli-codon optimized DNA encoding residues 
167-316 of the C. elegans MCU (CMCU-ANTD) with C-terminal 6 x His tag was 
synthesized (GenScript) and cloned into the pET21a vector (Extended Data Fig. 1). 
BL21(DE3) cells were transformed with the vector for expression. The cells were 
grown in either LB broth or isotopically labelled M9 minimal media at 37°C until 
absorbance at 600 nm (Ago nm) reached 0.6-0.7. After induction with 0.2 mM iso- 
propyl-$-p-thiogalactopyranoside (IPTG), protein was expressed for 20h at 18°C. 
Cells were collected by centrifugation and resuspended and lysed by sonication in 
buffer A (20mM HEPES, pH7.4, 150mM NaCl, 10j.g ml! lysozyme, 101g ml“! 
DNase, 1 mM PMSF and 2mM EDTA). After lysis, the inclusion bodies and mem- 
branes were collected by centrifugation at 39,000g for 1h and resuspended in buffer 
B (20mM HEPES, pH 7.4, 150mM NaCl, 40 mM foscholine-14 and 2mM EDTA), 
followed by stirring at 4°C overnight. Insoluble aggregates were removed by cen- 
trifugation at 39,000g for 40 min. The supernatant was then subjected to Ni-NTA 
purification in buffer B with foscholine-14 concentration adjusted to 0.48 mM 
(4x detergent CMC). The protein was eluted using 400 mM imidazole in the same 
buffer and dialysed against buffer C (25 mM CHES, pH 9.2, 50 mM NaCl and 
2mM EDTA) to remove imidazole. After dialysis, the sample was subjected to 
ion-exchange purification using the HiTrap Q HP (1 ml) column (GE Healthcare) 
in buffer C with added 0.48 mM foscholine-14. The protein was eluted using a NaCl 
gradient (0.05-1.00 M in 20 ml). Finally, the protein was further separated by size 
exclusion using the Superdex 200 10/300 GL column (GE Healthcare) in buffer 
D (20mM MES, pH 6.4, 75 mM NaCl, 0.48 mM foscholine-14, 0.3 mM NaN3, and 
2mM EDTA). The elution peak fractions were analysed by SDS-PAGE (Extended 
Data Fig. 2) and pooled and concentrated to achieve 0.8 mM cMCU-ANTD (mon- 
omer) for NMR measurements. The final NMR sample buffer contains 20 mM 
MES, pH 6.4, 75mM NaCl, ~27 mM foscholine-14, 0.3mM NaN3, 2mM EDTA 
and 5% D0. Typical final yields of cMCU-ANTD were 2-3 mg of protein from 
11 of cell culture. 

SEC-MALS analysis of the CMCU-ANTD molecular mass. The instrument setup 
used for the SEC-MALS experiment consisted of an Agilent 1260 Infinity Isocratic 
Liquid Chromatography System connected in series with a Wyatt Dawn Heleos II 
Multi-Angle Light Scattering (MALS) detector (Wyatt Technology) and a Wyatt 
Optilab T-rEX Refractive Index Detector (Wyatt Technology). Analytical size- 
exclusion chromatography was performed at room temperature using Superdex 
200 10/300 GL column (GE Healthcare) equilibrated with a mobile phase contain- 
ing 20mM MES, pH 6.4, 75 mM NaCl, 0.48 mM foscholine-14, 0.3 mM NaN; and 
2mM EDTA. 100 il purified CMCU-ANTD sample at 4.0 mg ml! was injected 
into the column and eluted at a flow rate of 0.4mlmin~!. The column effluent 
was monitored in-line with three detectors that simultaneously monitored UV 
absorption, light scattering and refractive index. The data from the three detectors 
were imported by the ASTRA software package, and the three-detector method”* 
was used to determine the molecular mass. 

Characterization of the cMCU-ANTD oligomeric state by crosslinking. The 
oligomeric state of CMCU-ANTD in the NMR sample was examined by chemical 
crosslinking using DTSSP (3,3’-dithiobis(sulfosuccinimidyl propionate) (Thermo 
Scientific)). The cCMCU-ANTD sample was first dialysed overnight against the 
crosslinking reaction buffer (25 mM sodium phosphate, pH 7.5, and 50mM NaCl). 
A stock solution of 50 mM DTSSP in the reaction buffer was prepared for addition 
to the protein solution. Five samples each containing 0.1 mM cMCU-ANTD in 
30 11 reaction buffer were mixed with 0, 5, 7, 10 and 15 mM DTSSP, respectively, at 
room temperature. After 1h, the reactions were quenched by adding 2 1] of the stop 
solution (1 M Tris, pH.7.5). The reaction mixtures were analysed using SDS-PAGE, 
and visualized using silver stain. 

ITC analysis of Ru360 binding. ITC was used to investigate whether CMCU- 
ANTD can interact with the known inhibitor Ru360 under the NMR sample con- 
dition. The protein samples used for ITC consisted of 20mM MES pH 6.4, 75 mM 
sodium chloride, 4mM foscholine-14 and 76,1.M cMCU-ANTD (monomer). The 
inhibitor solution contained 200\1M Ru360 in 20mM MES, pH 6.4, 75mM sodium 
chloride and 4mM foscholine-14. The titration protocol involved injecting 0.5 il 
inhibitor solution to 2021 protein solution for the first point and 1 11 for each of 
the following 34 points (Extended Data Fig. 1c). Experiments were performed at 
25°C using Microcal ITC200 (GE Healthcare). The same ITC experiment was also 
performed for the CMCU-ANTD mutant with the $238A mutation. 

Negative stain EM analysis. For EM analysis, the protein sample obtained from 
the size-exclusion chromatography above was subjected to two more rounds of 
size exclusion to achieve even higher homogeneity. The middle 20% of the elution 
peak was collected and subjected to another round of size-exclusion purification 


under the same condition. The middle 20% of the elution from the second size 
exclusion run was collected and subjected to a third round of size exclusion. Finally, 
the middle 20% of the third elution was collected as the final sample for EM study. 

To prepare sample for collecting negative stain EM images, 511 of the CMCU- 
ANTD sample from the third size exclusion run (10j.gml! protein in 20mM 
MES, pH 6.5, 75mM NaCl, 0.48 mM foscholine-14, 1x protease inhibitor cocktail 
(Thermo) and 0.1 mM NaN3) was applied to a glow-discharged 400 mesh con- 
tinuous carbon grid (Beijing Zhongjingkeyi Technology). The sample was then 
negatively stained with 0.75% (wt/vol) uranyl formate solution, and spontaneously 
dried at room temperature. The data were recorded on a Tecnai G2 Spirit BioTWIN 
transmission electron microscope (FEI) operated at 120kV and equipped with 
an Eagle 4K x 4K CCD camera. We recorded 514 images at 110,000 microscope 
magnification with a pixel size of 1.05 A per pixel. The defocus ranged from 1.0 to 
1.5j.m. For initial model building, Random Conical Tilt (RCT) image pairs were 
manually taken at tilt angles of 45° and 0°, respectively. 

We boxed out 12,860 particles by using the e2boxer.py program in EMAN2.1 
(ref. 29) (Extended Data Fig. 4a). Contrast transfer function parameters were 
determined for particles boxed out from each CCD image using EMAN1.9 
(ref. 30) procedure ctfit, followed by phase flipping using the applyctf program. 
The data were then low-pass filtered to 10 A to enhance the image contrast for 
three-dimensional (3D) reconstruction". Reference-free 2D analysis used the 
EMAN1.9 program refine2d.py, which revealed the existence of five-fold sym- 
metry in the cCMCU-ANTD sample (Extended Data Fig. 4b). The initial model 
was generated from RCT data using e2rct.py program in EMAN2.1. This model 
was further refined with the 12,860 untilted particles by using the refine pro- 
gram and calling the FRM2D image alignment kernel***3 in EMAN1.9. Initially, 
no symmetry was imposed in the 3D reconstruction process (Extended Data 
Fig. 4c), and subsequently the five-fold symmetry revealed by reference-free 2D 
analysis was imposed in the 3D reconstruction process. The final resolution was 
estimated at 18 A by the 0.5 FSC criteria using the eotest program in EMAN1.9 
(Extended Data Fig. 4d). 

Assignment of NMR resonances. All NMR experiments were conducted at 
either 23 °C or 33°C on Bruker spectrometers equipped with cryogenic probes. 
NMR spectra were processed using NMRPipe™ and analysed using ccpNMR*® 
and Xeasy*°, Sequence-specific assignment of backbone 'HN, 5N, 8C?, 3C° and 
13C! chemical shifts was achieved using the TROSY versions of standard triple 
resonance experiments including HNCA, HN(CO)CA, HNCACB, HN(CA) 
CO and HNCO?”*. In addition, a 3D HSQC-NOESY-TROSY experiment with 
15), 5N and 'HN evolution in the ft), t) and t; dimensions, respectively, was 
recorded with an NOE mixing time (tyog) of 200 ms. These experiments were 
performed using multiple (1°N, °C, 7H)-labelled protein samples on a 600 MHz 
spectrometer at 33°C. Multiple samples were used due to the poor stability of the 
protein at temperature >23 °C, that is, the sample began to show precipitation 
after ~7 days at 33°C. Despite the problem, the higher temperature was used 
to obtain more favourable T> for triple resonance experiments. By combining 
the triple resonance data with the use of NOESY, we were able to confidently 
assign 91% of non-proline residues, although only 77% could be assigned if using 
only the triple resonance spectra. Protein aliphatic and aromatic resonances were 
assigned using a combination of 2D °C-edited HSQC, 3D !°N-edited NOESY- 
TROSY (tog = 100 ms) and 3C-edited NOESY-HSQC (tog = 150 ms) recorded 
on a 900 MHz spectrometer at both 23°C and 33°C. These experiments were per- 
formed using multiple (1°N, °C)-labelled protein samples in which foscholine-14 
was deuterated (Anatrace). The data sets recorded at two different temperatures 
provided complementary information. On average, the spectra at 23°C show 
stronger peaks for the extramembrane regions but very weak peaks for the TM 
pore domain, especially the selectivity segment, possibly due to higher rigidity 
of the TM domain. At 33°C, however, the TM resonances that were too weak to 
analyse in the 23°C spectra became sufficiently intense. 

Assignment of NOEs. Short-range NOEs used for defining local and secondary 
structures were assigned using the !"N-edited NOESY-TROSY and '°C-edited 
NOESY-HSQC recorded at two different temperatures as described above. 
Intermonomer NOEs between the backbone amide protons of a monomer and 
the non-exchangeable aliphatic or aromatic protons of its neighbouring monomers 
were assigned using a sample that was reconstituted with approximately 1:1 mixture 
of (°N, *H)-labelled cCMCU-ANTD and (15% '°C)-labelled cCMCU-ANTD. The 
non-deuterated subunit was (15% 13C)-labelled for recording the 'H-'°C HSQC 
spectrum (optimized for methyl groups) as internal aliphatic proton chemical shift 
reference while providing stereospecific assignment of leucine and valine methyl 
groups”. Mixing was done at the cell level before cell lysis, that is, the amounts 
of differently labelled cells were adjusted based on the protein expression level 
to ensure approximately 1:1 ratio of (1°N, 7H)- and (15% !9C)-labelled cCMCU- 
ANTD. In this sample, foscholine-14 was also deuterated. Recording a ISN-edited 


© 2016 Macmillan Publishers Limited. All rights reserved 


NOESY-TROSY using this sample allowed exclusive detection of NOE cross 
peaks between the /°N-attached protons of one monomer and aliphatic protons 
of adjacent monomers. The 3D “N-edited NOESY-TROSY (tTNog] =300 ms) was 
recorded using this type of sample at both 23°C and 33°C. The intermonomer 
NOEs between the neighbouring CCHs and TMH2s (see main text) effectively 
defined the core of the CMCU-ANTD complex. The initial structural solution 
of the core assembly then allowed us to assign iteratively additional intra- and 
intermonomer long-range NOEs between the aliphatic and aromatic protons in 
5N-edited NOESY-TROSY and !3C-edited NOESY-HSQC recorded at both 23°C 
and 33°C. These long-range NOEs subsequently defined packing of TMH] against 
TMH2, packing of H1 against CCH, as well as interaction between IJMH and 
OJMH. 

Structure calculation. Structures were calculated using the program XPLOR- 
NIH. The local and secondary structures of the monomer were first defined 
using short-range NOE restraints and backbone dihedral restraints derived from 
chemical shifts (using TALOS+“!). A total of 10 monomer structures were cal- 
culated using a standard simulated annealing protocol. Five copies of the low- 
est-energy monomer structure were used to construct an initial model of the 
pentamer using the intermonomer NOE restraints assigned for the pore-forming 
TMH2 and CCH helices. For each intermonomer restraint between two adjacent 
monomers, five identical distance restraints were assigned respectively to all pairs 
of neighbouring monomers to satisfy the condition of C5 rotational symmetry (as 
indicated by the EM reconstruction). The assembled pentamer was used as a start- 
ing model to guide assignment of more long-range NOEs in the '°N-edited and 
13C-edited NOE spectra recorded at 23°C and 33°C. This process was repeated 
iteratively until the calculated models converged to backbone r.m.s.d. of <1A 
(not including the disordered regions). In each of the iterations, the starting 
model was refined against dihedral restraints and local and long-range intra- and 
intermonomer NOE restraints using a simulated annealing protocol in which the 
bath was cooled from 1,000 to 100 K. The NOE restraints were enforced by flat-well 
harmonic potentials, with the force constant ramped from 25 to 50kcal mol”! A~? 
during annealing. For the defined helical regions, backbone dihedral angle 
restraints (6 = —60°, Y= —40°) were applied, all with a flat-well (+10°) 
harmonic potential with force constant ramped from 15 to 30kcal mol! rad~?. 
In the final round of refinement, a total of 150 structures were calculated and 
15 low-energy structures were selected as the structural ensemble. Ramachandran 
plot statistics for the structure ensemble, calculated using PROCHECK”, are as 
follows: most favoured (86.3%), additionally allowed (11.0%), generously allowed 
(1.8%) and disallowed (1.0%). Restraint and refinement statistics are shown in 
Extended Data Table 1. 

Characterization of the oligomeric state of the CCH peptide by crosslinking. 
To examine whether the C-terminal CCH domain is able to oligomerize by itself 
in water, we investigated the oligomeric state of a CCH-containing peptide cor- 
responding to residues 288-316 of cMCU (plus the C-terminal L and E as in the 
cMCU-ANTD construct) by chemical crosslinking using DTSSP (3,3’-dithiobis 
(sulfosuccinimidyl propionate) (Thermo Scientific). Approximately 0.4mg syn- 
thesized peptide (Pepmic Co., Ltd) was dissolved in 1 ml of reaction buffer (25 mM 
sodium phosphate, pH 7.5, and 50mM NaCl) to a concentration of 0.1 mM. 
A stock solution of 30mM DTSSP in water was prepared for addition to the peptide 
solution. Four samples, each containing 0.1mM peptide in 301] reaction buffer 
were incubated with 0, 1, 3 and 5mM DTSSP at room temperature. After 1h, the 
reactions were quenched by adding 1 1l of stop solution (1M Tris, pH 7.5). The 
reaction mixtures were analysed using SDS-PAGE. The SDS-PAGE results show 
that at DTSSP concentrations of 3 and 5 mM, the peptides could be crosslinked 
up to pentamers (Extended Data Fig. 7), indicating that the synthesized peptide 
can form pentamers in water. 

Functional mutagenesis of HSMCU. Material. Reagents were obtained from the 
following sources: anti-Flag M2 affinity gel from Sigma (A2220), ATP5A antibody 
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from Abcam (ab14748), Oregon Green 488 BAPTA-6F from Life Technologies 
(023990). 

Cell culture and cell lines. MCU knockout cell lines were generated as previously 
described’. Cells were infected with lentivirus for stable expression of Flag-tagged 
proteins and selected with 1001g ml! puromycin. Cell lines have been tested for 
mycoplasma contamination on a monthly basis using MycoAlert Mycoplasma 
Detection Kit (Lonza, LT07-418) and are free of mycoplasma contamination. The 
parental HEK-293T cell line has the following STR profile: THO1 (7, 9.3); D21S11 
(28, 29, 30.2); D5S818 (7,8,9); D13S317 (11,12,13,14,15); D7S820 (11); D16S539 
(9, 13); CSELPO (11,12,13); Amelogenin (X); vWA (16,18,19,20); TPOX (11). This 
profile matches 100% to HEK-293T cell line profile (ATCC, CRL-3216) if the 
Alternative Master's algorithm is used, and 83% if the Tanabe algorithm is used. 
MCU knockout cell line was generated from this parental cell line by single cell 
cloning after MCU was knocked out using TALEN technology. 

Calcium uptake in permeabilized HEK-293T cells. Calcium uptake assay in per- 
meabilized HEK-293T cells was done as previously described’. Rate of calcium 
uptake is defined as the slope of the linear portion of the calcium uptake curve 
(between 20 and 30s). In each experiment, calcium uptake rates of samples were 
normalized to the calcium uptake rate of a randomly selected wild-type HEK- 
293T cell uptake rate. 

HsMCU-ANTD cloning and expression. For expression and mitochondrial target- 
ing of HHMCU-ANTD, cDNA that corresponds to H3MCU amino acids 187-351 
was fused to HSMCU mitochondrial targeting signal (amino acids 1-56) and 
cloned into pLYS1 vector as described previously’. 
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hMCU-FL 1 SSRGGGGGGAGGC Cc H QRI 60 
cMCU-FL al --MRNGRCLVTPE--------------- L I TTASSSTS- 42 
hMCU-FL 61 LGAVYC QO: 120 
cMCU-FL 430 =H-S5 IQE P Ss 96 
hMCU-FL 121 177 
cMCU-FL 97 156 
hMCU-FL 178 237 
cMCU-FL 157 216 
cMCU-ANTD 167 216 
hMCU-FL 238 297 
cMCU-FL 217 276 
cMCU-ANTD 217 276 
hMCU-FL 298 LF VHLPLROQIGEKD--- 351 
cMCU-FL 277 KQ E FQHLPVSYLSNLEAEK 333 
cMCU-ANTD 277 KQ E HHHHHH 324 
Extended Data Figure 1 | Multiple sequence alignment of the full- in the DXXE motif are indicated by downward arrow. The serine residue 
length hMCU, full-length cMCU and cMCU-ANTD. Residues that are shown to be involved in Ru360 inhibition is indicated by an asterisk’ . 
invariant in all three sequences are shaded in red. Partially conserved and Helical segments, as determined by NMR in this study, are indicated by 
much less conserved residues are shaded in blue and grey, respectively. The cylinders and labelled as in the main text. The accession numbers for 
mutations introduced in cCMCU-ANTD are shaded in yellow. D and E hMCU and cMCU are NM_138357.1 and NP_500892.1, respectively. 
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Extended Data Figure 2 | Biochemical analysis of CMCU-ANTD 
oligomeric state. a, Elution peak of CMCU-ANTD from Superdex 

200 10/300 GL column in 20 mM MES, pH 6.4, 75 mM NaCl, 0.48 mM 
foscholine-14, 0.3 mM NaN; and 2mM EDTA. b, SDS-PAGE analysis 

of the elution peak showing sample purity >95%. c, SEC-MALS analysis 
of the diluted NMR sample of CMCU-ANTD. The SEC-MALS/UV/ 
refractive index measurement was used to determine CMCU-ANTD 
molecular mass based on the three-detector method”®. In this method, 
since the foscholine-14 detergent does not have UV absorption at 280 nm, 
the protein mass is directly calculated without correcting for the bound 
micelle. Chromatograms show the readings from the light scattering at 90° 
(red), refractive index (blue), and UV (green) detectors. The left and right 
axes represent the light scattering detector reading and molecular mass, 
respectively. The black curve represents the calculated molecular mass, 
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and the average mass of the elution peak of cCMCU-ANTD is 

92 kDa. Note the ~10 ml difference in elution volumes between a and 

c is due to the volume of solution feeding from SEC to MALS. d, SDS- 
PAGE analysis of chemical crosslinking of the diluted cCMCU-ANTD 
NMR sample. The reaction mixture contains 0.1 mM cMCU-ANTD 
(monomer), 3 mM foscholine-14, and various amounts of DTSSP. The 
reactions were quenched after 1h by the addition of 21] of 1 M Tris, 

pH 7.5. The quenched samples were loaded to 12% Bis-Tris gel (Novex Life 
Technologies). The gel was silver stained using the standard protocol. The 
five lanes to the right of the molecular mass marker correspond to DTSSP 
concentrations of 0, 5, 7, 10 and 15 mM. The band that corresponds to a 
pentamer showed the most obvious increase in intensity as a function of 
[DTSSP]. 
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Extended Data Figure 3 | The cCMCU-ANTD methy] group resonances with residue specific assignment. The 'H-'*C HSQC was recorded with 28 ms 
constant-time '°C evolution at 900 MHz. 
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Extended Data Figure 4 | Single particle EM of the cMCU-ANTD 
oligomeric complex. a, Typical image of the cMCU-ANTD oligomers 
negatively stained with uranyl formate. The bar corresponds to 50 nm 

in length. A selected subset of CMCU-ANTD particles is highlighted 
with white circles. Shown in the top left corner are a typical top/bottom 
view and a typical side view of the selected particles. b, Gallery of 100 out 
of 202 reference-free 2D class averages of the particles, which revealed 
the existence of five-fold symmetry in the complex (pentagon shapes 
marked by yellow circles). c, Comparison of different views of the 3D EM 
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density map reconstructed without enforcing C5 symmetry (top) with the 
corresponding views of the map reconstituted with symmetry (bottom). 
The largest discrepancies are in the middle bulge region between the TM 
and CC domains, possibly due to the lack of rigid structure in the large L2 
loop. d, Comparison of the 2D projections (top) from the 3D EM density 
map with the corresponding reference-free 2D classes. e, Estimation of 
resolution of the final 3D reconstruction. The Fourier shell correlation 
(FSC) suggests a resolution of ~18 A using the 0.5 criterion. 
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Extended Data Figure 5 | Intermonomer NOEs from mixed isotope 
labelled sample. Examples are taken from the 3D !"N-edited NOESY- 
TROSY of the mixed labelled sample containing 1:1 mixture of (!°N, 7H)- 
labelled CMCU-ANTD and (15% 3C)-labelled CMCU-ANTD. a, Sample 
'H-'H strips at various !°N chemical shifts showing intermonomer NOEs 
between backbone amide proton and aliphatic protons for the C-terminal 
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CCH domain. The NOE spectrum was recorded at 23°C at 900 MHz. 

b, Sample strips showing intermonomer NOEs within the TMH2 pore as 
well as the selectivity filter region. The NOE spectrum was recorded at 
33°C at 900 MHz. On the right of each panel, intermonomer NOEs in the 
context of the structure are shown as red lines. 
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Extended Data Figure 6 | Structural ensemble of the cCMCU-ANTD fitting (the ‘fit’ tool in Chimera). The L1 and L2 are however included for 
pentamer derived from NMR restraints and fitting to the EM density display. Top view from the intermembrane space side. c, Bottom view from 
map. a, Ensemble of the 15 lowest-energy structures calculated using the matrix side. d, e, Two different side views. Note that the loop regions 
NMBR-derived structural restraints (see Extended Data Table 1). The appear disordered due to the lack of NMR-derived structural restraints. 
unstructured loops L1 (residues 166-179) and L2 (residues 272-292) are This does not necessarily mean that they do not acquire any stable 

not shown for clarity. b, The NMR structure of CMCU-ANTD without conformation. The presumed detergent molecules around the membrane- 
the loop regions L1 and L2 was fitted to the EM volume using rigid body embedded region are not taken into account in this fit. 
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Extended Data Figure 7 | SDS-PAGE analysis of chemical crosslinking samples were loaded to 12% Bis-Tris gel (Novex Life Technologies). 


of the CCH peptide. The reaction mixtures containing 0.1 mM peptide The gel was silver stained using the standard protocol. The four lanes to 
(cMCU residues 288-316 plus the C-terminal L and E as in the CMCU- the right of the molecular mass marker correspond to DTSSP:CCH ratios 
ANTD construct) and various amounts of DTSSP were quenched after of 0, 10:1, 30:1 and 50:1. 


1h of reaction by the addition of 1 jl of 1 M Tris, pH 7.5. The quenched 
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Extended Data Figure 8 | Surface representation for revealing the the charged residues include K, R, D and E. The solid lines indicate the 
surface-exposed and core amino acid properties of the cCMCU-ANTD hydrophobic core boundaries of the presumed lipid bilayer. a, Pentamer 
pentamer. Hydrophobic, polar and charged residues are shown in yellow, with unstructured loops removed. b, The same view as in a but with the 
cyan and blue, respectively. The hydrophobic residues include A, I, L, F, front subunit removed to reveal the core. 


V, P and G, the polar residues include Q, N, H, S, T, Y, C, M and W, and 
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Extended Data Figure 9 | Deletion of the NTD (amino acids 58-186) in 
HsMCU (HsMCU-ANTD) does not impair its function. Representative 
traces of Ca”* uptake in digitonin-permeabilized cells after addition of 
50M CaCl, are shown on the left. The bar graph shows the rate of Ca** 
uptake relative to wild-type HEK-293T cells (mean + s.d., n=4). Cell 
lysates were analysed by immunoblotting using an anti-Flag antibody to 
detect expression of MCU protein. ATP5A was used as loading control. 
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Extended Data Table 1 | NMR and refinement statistics for protein structures 


cMCU-ANTD 
NMR distance and dihedral constraints 
Distance constraints 
Total NOE 2440 
Intra-residue 130 
Inter-residue 2310 
Sequential (|i-j| = 1) 1280 
Medium-range (|i-j| < 4) 660 
Long-range (|i-j > 5) 150 
Intermolecular 220 
Hydrogen bonds 0 
Total dihedral angle restraints 990 
phi 495 
psi 495 
Structure statistics 
Violations (mean and s.d.) 
Distance constraints (A) 0.231 + 0.002 
Dihedral angle constraints (°) 1.485 + 0.034 
Max. dihedral angle violation (°) 9.511 
Max. distance constraint violation (A) 1.487 
Deviations from idealized geometry 
Bond lengths (A) 0.034 + 0.000 
Bond angles (°) 1.777 + 0.006 
Impropers (°) 1.150 + 0.016 
Average pairwise r.m.s.d.** (A) 
Heavy 1.503 
Backbone 0.920 
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The numbers of constraints are summed over all five subunits. Backbone ¢ and 7 restraints of -60° and —40°, respectively, were assigned for regions of the protein confirmed to be helical based on lo- 
cal NOEs and TALOS+%%. Statistics are calculated and averaged over an ensemble of the 15 lowest-energy structures out of 150 calculated structures. The precision of the atomic coordinates is defined 
as the average r.m.s.d. between the 15 final structures and their mean coordinates. The calculation includes only the structured regions of the protein: residues 180-192, 194-271 and 292-316. 
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Extra-helical binding site of a glucagon receptor 


antagonist 
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Glucagon is a 29-amino-acid peptide released from the a-cells of the 
islet of Langerhans, which has a key role in glucose homeostasis'. 
Glucagon action is transduced by the class B G-protein-coupled 
glucagon receptor (GCGR), which is located on liver, kidney, 
intestinal smooth muscle, brain, adipose tissue, heart and pancreas 
cells, and this receptor has been considered an important drug 
target in the treatment of diabetes. Administration of recently 
identified small-molecule GCGR antagonists in patients with 
type 2 diabetes results in a substantial reduction of fasting and 
postprandial glucose concentrations”. Although an X-ray structure 
of the transmembrane domain of the GCGR’ has previously been 
solved, the ligand (NNC0640) was not resolved. Here we report the 
2.5 A structure of human GCGR in complex with the antagonist 
MK-0893 (ref. 4), which is found to bind to an allosteric site outside 
the seven transmembrane (7TM) helical bundle in a position 
between TM6 and TM7 extending into the lipid bilayer. Mutagenesis 
of key residues identified in the X-ray structure confirms their role 
in the binding of MK-0893 to the receptor. The unexpected position 
of the binding site for MK-0893, which is structurally similar to 
other GCGR antagonists, suggests that glucagon activation of the 
receptor is prevented by restriction of the outward helical movement 
of TM6 required for G-protein coupling. Structural knowledge of 
class B receptors is limited, with only one other ligand-binding 
site defined—for the corticotropin-releasing hormone receptor 1 
(CRF,R)—which was located deep within the 7TM bundle?. 
We describe a completely novel allosteric binding site for class B 
receptors, providing an opportunity for structure-based drug 
design for this receptor class and furthering our understanding of 
the mechanisms of activation of these receptors. 

To obtain a high-resolution structure of the human GCGR trans- 
membrane domain (TMD), a thermostabilized receptor (StaR) was 
generated® * containing 11 amino-acid substitutions. To facilitate crys- 
tallization further, the extracellular domain was removed from the N 
terminus (residues 2-135), and the C terminus was truncated by 60 
residues (residues 418-477). Finally, T4-lysozyme (T4L) was inserted 
into intracellular loop (ICL)2 between Leu255 and Pro259 (Fig. 1a), 
resulting in the construct designated GCGR-StaR(136-417)-T4L. 
The structure was solved in the presence of the antagonist MK-0893 
(Fig. 1b and Extended Data Table 1). The construct modifications did 
not alter the antagonist-binding properties of the receptor compared 
with wild type (Extended Data Table 2). 

The core fold of the receptor features the canonical 7TM helices 
(TM1-TM7) (Fig. 1c) in a similar conformation to the previously 
published GCGR structure*. Continuous density is observed for intra- 
cellular loops ICL1 and ICL3, while extracellular loop (ECL)2 adopts a 
conformation capping the entrance to the orthosteric site. In contrast 
with the previously published GCGR structure’, the N terminus of TM5 
unwinds by one helical turn, permitting ECL2 to stretch across to the 


central axis of the TM helical bundle, mediating interactions from TM3 
across to TM6 and TM7 while maintaining the conserved disulfide 
bond between Cys224>*> and Cys294 (numbers in superscript refer to 
the modified Ballesteros numbering system for class BGPCRs”"). The 
highly conserved sequence motif GWGxP in TM4 of class B receptors 
has an important structural role supplying interactions stabilizing the 
configuration of TM2, TM3 and TM4 (ref. 5). In this GCGR structure, 
TM4 bulges at Gly271*4*" and, along with Pro275*>*, they disrupt 
intra-helical hydrogen bonding and result in positioning of Trp272*°”” 
towards TM2 and TM3, which forms a hydrogen bond with Asn1797°> 
on TM2 in an analogous fashion to CRF R. In addition, a hydrogen 
bond from the side chain of Tyr2333**° to the backbone carbonyl of 
Trp272**” further strengthens the TM4-TM3 interaction. 
Unexpectedly, strong and unambiguous density is observed for the 
MK-0893 antagonist outside the 7TM helical bundle (Fig. 1c-e), strad- 
dling TM6 from within the lipid bilayer. TM6 then acts to divide the 
binding site into two distinct regions: a hydrophobic interface with 
TMS, and a polar cleft towards TM7. The different physicochemical 
properties of this bipartite antagonist pocket correspond to the dual 
hydrophilic/hydrophobic nature of the ligand. The apolar meth- 
oxynaphthalene moiety of the small molecule makes hydrophobic 
contacts in the TM5-TM6 interface with Leu329°>°!>, Phe3 45°, 
Leu352°*», Thr353°* and the alkyl chain of Lys349° (Fig. 2a). 
On the opposite site, within the TM6-TM7 cleft, MK-0893 partici- 
pates in a network of polar contacts: the ligand amide group hydro- 
gen bonds with Lys349°4° and Ser350°4!>, while the carboxyl group 
forms a salt bridge with Arg346°*”>. This moiety also makes additional 
polar interactions with Asn4047°!, the backbone of Lys405 (located 
between TM7 and H8) and a water-mediated hydrogen bond with 
Ser350°*"> and Leu39975° (Fig. 2a). The phenylethylpyrazole core of 
the molecule makes further interactions with TM6, in particular with 
Thr353°” and Lys349°, The position of the pyrazole moiety parallel 
to the membrane provides the two ligand vectors towards the bipartite 
antagonist sub-pockets. A third vector starting from the ligand pyrazole 
ring leads to the dichlorophenyl group, which makes a crystal contact 
to TM4 of a symmetry mate (see Extended Data Fig. 1). Molecular 
dynamics simulations of MK-0893 binding to the wild-type receptor 
(outside the constraints of the crystal system) demonstrate that the 
receptor-ligand interaction is stable, and MK-0893 remains at the cyto- 
plasmic membrane boundary with the carboxyl moiety able to interact 
with intracellular solvent and with interactions between the ligand and 
receptor, involving Arg346°3”, Lys349°4%, Asn4047-°!5 and Lys405, 
maintained (Extended Data Fig. 2). Sequence conservation analysis 
demonstrates that, with the exception of Thr353°4" and Phe345°3%, 
the other key binding-site residues show good conservation across 
other members of human class B receptors (Extended Data Fig. 5). 
Tritium -labelled MK-0893 was prepared and used to characterize 
the ligand-binding site. Membrane fractions prepared from HEK293T 
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Figure 1 | Structure of GCGR and the MK-0893 allosteric binding site. 
a, Crystallization construct showing stabilizing mutations (green), binding 
site residues (pink), disordered residues not located (grey), and the 
disulfide bond between Cys224°* and Cys294 (yellow line). b, Chemical 
structure of MK-0893. c, d, Ribbon representation of GCGR (blue), viewed 
parallel to the membrane (c) and from extracellular space (d). The position 


cells expressing the wild-type GCGR were used in saturation bind- 
ing analysis. Saturation binding was monophasic and best fitted to a 
one-site model with a linear Scatchard plot, consistent with the pres- 
ence of a single high-affinity binding site (Fig. 3a). Competition with 
[7H] MK-0893 was used to characterize the binding of a number of 
reported GCGR antagonists (see Extended Data Fig. 3 for details). 
These compounds were selected to represent molecules that exhibit 
chemical similarity to MK-0893 (for example, NNC0640, Cpd-01, 02, 
03 and 04), as well as those that are chemically distinct (for example, 
Cpd-05 and 06). Consistently, NNC0640, Cpd-01, 02, 03 and 04 were 
able to fully compete with [7H]MK-0893 binding, indicating that these 
compounds share the same binding site (Fig. 3b-f). By contrast, Cpd-05 
and Cpd-06 were not competitive with [>H]MK-0893, indicating that 
these compounds bind to a different site (Fig. 3g, h). Furthermore, (?PH]- 
MK-0893 was not displaced by glucagon or the related peptide antago- 
nist des-His1-[Glu9]-glucagon (Fig. 3i, j), which bind at the orthosteric 
site. Interestingly, NNC0640 was the ligand used in the crystallization of 
the first reported GCGR structure*; however, the position of the ligand 
was not resolved. A clear peak is observed in the electron density map 
of GCGR-NNC0640 between TM6 and TM7 towards the intracellular 
side of the receptor. Although this was modelled as a polyethylene 
glycol (PEG) molecule in the reported structure, superposition with 
the GCGR structure reported here demonstrates that the amide moiety 
and carboxyl function of MK-0893 directly overlays with this peak 
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of the T4L insertion to ICL2 is indicated. MK-0893 in stick representation 
with carbon, nitrogen, oxygen and chlorine atoms coloured yellow, blue, 
red and green, respectively, F, — F: OMIT density contoured at 2.50. 

e, Surface representation of GCGR embedded within the membrane 

(20 ns molecular dynamics simulation), lipids in proximity of MK-0893 
removed for clarity. 


(Extended Data Fig. 4). Given the chemical similarity of these ligands, 
coupled with our competition data, it is likely that this constitutes 
residual signal from NNC0640 binding in an analogous position to 
MK-0893 on GCGR rather than being a PEG molecule. 

To confirm the allosteric pocket identified in the structure, single 
point mutations were made to residues in the binding site and the 
binding of [7H]MK-0893 was subsequently assessed. The mutations 
were introduced in the full-length wild-type receptor with a C-terminal 
enhanced green fluorescent protein (eGFP) tag. The presence of the 
eGFP tag has no impact on MK-0893 binding or glucagon activation 
of the receptor (data not shown). For this analysis, Arg346°>”> was 
mutated to alanine or glutamic acid. Lys349°*" was mutated to either 
alanine or methionine. Ser350°*"> and Asn4047°! were changed 
to alanine. These residues were selected as they participate in polar 
interactions with the ligand and thus were considered more likely to 
have a measurable effect on ligand binding (Fig. 2a). In addition, these 
residues interact with the amide and carboxyl functions of MK-0893, 
which have been demonstrated to be critical for the antagonist activity 
of the ligand‘. Fluorescence-activated cell sorting (FACS) analysis using 
an antibody to the extracellular surface of receptor was performed to 
compare cell surface expression of mutants with the wild-type receptor. 
None of the mutations caused any reduction in the cell surface expres- 
sion levels of the receptor nor had any effect on the binding affinity 
of glucagon peptide for the orthosteric site (Extended Data Fig. 6 and 
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Figure 2 | Confirming the MK-0893-binding site in GCGR. a, Diagram 
of ligand interactions in the MK-0893-binding site. Hydrogen bonds are 
depicted as dashed red lines with distances between heavy atoms in A. 
GCGR in ribbon representation is coloured blue, MK-0893 in stick 
representation is coloured as per Fig. 1. b, Comparison of pKq of wild 
type (WT) with the mutants. Data are average of three independent 
experiments and error bars represent standard error of the mean (s.e.m.). 
P values are calculated from a two-tailed t-test. NS, not significant. The data 
set for R346E and N404A did not fit the one-site binding unambiguously 
due to near complete loss of specific binding. ND, not determined. 


Extended Data Table 2). Mutation of Arg346°%”° to glutamic acid or 
Asn4047°"» to alanine reduced binding levels to close to undetectable 
above non-specific binding, while mutation of Lys349°” to alanine 
or methionine, and mutation of Arg346°37> to alanine, significantly 
reduced the binding affinity of [7H] MK-0893 (Fig. 2b and Extended 
Data Fig. 7). Consistently, these mutations also reduced the ability of 
MK-0893 to antagonize the glucagon-mediated increase in cAMP 
(Extended Data Fig. 8). 

The MK-0893 binding mode suggests that the ligand acts as a clamp 
holding TM6 in the inactive state and hampering receptor conforma- 
tional changes required for G-protein coupling (Fig. 4a, c). In the case 
of class A receptors, activation results in a rotation of TM6 in response 
to agonist binding, which is transmitted through a rigid-body move- 
ment amplified along TM6, thereby altering the interface between TM5 
and TM6 and leading to an outward movement of the cytoplasmic end 
of TM6 to enable G-protein binding’. So far, no active structures of 
class B receptors have been reported; however, the ability of MK-0893 
to block signalling of GCGR points to a similar critical role of TM5 
and TM6 in class B receptor activation. This is consistent with the 
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Figure 3 | Pharmacology of MK-0893. a, Saturation binding of 
[7H]MK-0893 to membranes containing wild-type GCGR. Inset shows 
the Scatchard plot. b-j, Representative competitive binding data 

for inhibition of [3H]MK-0893 binding to membranes containing 
wild-type GCGR in the presence of indicated compounds (for chemical 
structures, see Extended Data Fig. 3). Values represent an average of 

at least three independent experiments + s.e.m. Ky, dissociation 
constant. pK;, negative logarithm of the inhibition constant. Bmax, total 
concentration of receptors. B and F in the Scatchard plot denote bound 
and free ligand concentrations, respectively. 


observation that, despite divergence in the extracellular arrangement 
of TM helices between class A and B receptors, very good structural 
conservation exists on the intracellular sides in the inactive con- 
formation®'!. An extra-helical binding site was recently described 
for 1-(2-(2-(tert-butyl)phenoxy)pyridin-3-yl)-3-(4-(trifluoromethoxy) 
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Figure 4 | MK-0893 allosteric mechanism of action. a, b, Ribbon 
representation of GCGR (blue) and P2Y; receptor (P2Y,R; yellow) 
respectively, viewed parallel to the membrane. MK-0893 in stick 
representation is coloured as per Fig. 1. BPTU is shown in stick 
representation with carbon, nitrogen, oxygen and fluorine atoms coloured 
yellow, blue, red and grey respectively. Potential restrictions on TM 
movements are indicated. c, View of cytoplasmic side of GCGR (rotated 
90° from a) superposed with the 32-adrenoceptor (32-AR) in complex 
with carazolol (grey) (Protein Data Bank (PDB) accession 3NY9) and the 
B2-AR-Gs complex (orange) (PDB accession 3SN6). d, Schematic overview 
of known binding positions of class A, B and C GPCR ligands. 


phenyl)urea (BPTU), an allosteric antagonist of the P2Y, receptor’3, 
although in this case the binding site was located between TM1, 2 and 3 
(Fig. 4b). It is likely that BPTU inhibits the movement of TM3 that is 
also critical in receptor activation. Together, these structures demon- 
strate that modulation of helical movements from membrane-proximal 
surfaces represents an alternative way of modifying the activity of 
GPCRs. 

The identification of an allosteric binding site in the class BGCGR 
located outside the canonical helical bundle further adds to the diversity 
of interactions now known to occur between GPCRs and their ligands 
in modifying receptor activation states (Fig. 4d). The GCGR-MK-0893 
structure provides insight into the activation mechanism of class B 
receptors, as well as facilitating the application of structure-based 
drug design strategies to discover compounds with improved qual- 
ities. Strong conservation of this binding site across human class B 
receptors indicates that the structural information provided here can be 
applied to other members of this medically relevant family of GPCRs. 
In future, drug design paradigms must consider that, in addition to the 
orthosteric binding pocket, extra-helical binding sites accessed from 
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the membrane or within the cell may provide alternative strategies to 
modulate receptor function. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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Extended Data Figure 1 | Packing interactions in the GCGR- and green, respectively. b, View as in a rotated 90° to view along c-axis. 
StaR(136-417)-T4L primitive orthorhombic crystal system. a, View c, Close-up of the hydrophobic/shape complementarity interaction of 
along b-axis, GCGR-StaR TMD in blue ribbon representation, T4L in the dichlorophenyl ‘head’ group of MK-0893 with residues on TM4 of a 
magenta ribbon representation, MK-0893 in stick representation with symmetry-related copy. 


carbon, nitrogen, oxygen and chlorine atoms coloured yellow, blue, red 
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Extended Data Figure 2 | Molecular dynamics analysis of the 
GCGR-MK-0893 complex. a, Structural alignment of the wild-type 
GCGR-MK-0893 complex at 0, 4, 8, 12, 16 and 20 ns molecular dynamics, 
colour-coded from blue to red as indicated by the arrow. Protein is 

shown as ribbon, MK-0893 as sticks. b, Extracellular view of the ligand 
conformations during the molecular dynamics simulation, colour-coded 
as in a. c, Root-mean-squared deviation (r.m.s.d.) in A for protein Ca 
(blue) and ligand heavy atom (pink) during the simulation, after structural 
alignment to the initial model. d, Number of protein-ligand contacts 
during the simulation: top, fluctuation of the total number of contacts 
over time; bottom, individual residues interacting with the ligand at a 
particular time are shown as rectangles colour-coded based on the 
number of contacts, from white (no contacts) to dark red (four contacts). 


e, Two-dimensional representation of the ligand—protein contacts. 

Green circles represent hydrophobic, cyan represents polar and purple 
are charged residues. Interactions that occur more than 20% of the 
simulation time in the selected trajectory are shown and the percentage 
frequency is marked. Hydrogen bonds are shown as pink arrows. 

A x-cation interaction is shown as a red line. The part of the ligand buried 
within the membrane is surrounded by a continuous line, while the 
water-exposed atoms (corresponding to the propionic acid moiety) are 
indicated by grey circles. f, GRID (Molecular Discovery) analysis of 

the ligand-binding site. The shape of the pocket is shown as grey mesh, 
hydrophilic and hydrophobic hotspots are shown using green and yellow 
transparent surfaces, respectively. This panel was prepared in Vida 
(OpenEye). 
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Extended Data Figure 3 | Chemical structures of compounds analysed in this study. 
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Extended Data Figure 4 | Structural superposition of GCGR structures. —_ and oxygen coloured red. m|F,| — A|F,| map calculated using the 


a, Structural superposition of the GCGR-StaR(136-417)-MK-0893 4L6R structure factors and 4L6R coordinates with the PEG molecule 
structure and GCGR-NNC0640 structure (PDB accession 4L6R) in omitted, difference density is rendered at 2.50. c, View as in b, with the 
blue and orange ribbon representation, respectively. b, Close-up view GCGR-StaR(136-417)-MK-0893 structure overlaid with MK-0893 in 

of the intracellular ends of TM5, TM6 and TM7 in 4L6R, as viewed stick representation with carbon, nitrogen, oxygen and chlorine atoms 
from within the membrane. A PEG molecule modelled in the 4L6R coloured yellow, blue, red and green, respectively. d, Representation as in c, 
coordinates is shown in stick representation with carbon coloured grey close-up and tilted towards the view from the cytoplasm. 
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5.61 6.36 6.39 6.43 6.44 


GCGR (P47871) L329 F345 A348 L352 1353 
CRF,R (P34998) L294 R310 V313 L317 V318 
CRF,R (Q13324) L290 R306 V309 L313 V314 
CALCR (P30988) L339 L355 V358 M362 1363 
CALRL (Q16602) L316 M332 V335 L339 1340 
GIPR (P48546) L321 L337 A340 L344 1345 
GLP1R (P43220) V331 C347 A350 L354 1355 
GLP2R (095838) L365 Y381 A384 L388 V389 
PACR (P41586) L331 L349 A352 L356 L357 
VIPR1 (P32241) L319 S337 A340 L344 L345 
VIPR2 (P41587) L306 K324 A327 L331 L332 
SCTR (P47872) L320 K338 A341 L345 L346 
GHRHR (Q02643) L307 W325 $328 L332 F333 
PTH1R (Q03431) L385 R404 L407 L411 V412 
PTH2R (P49190) L340 R359 A362 L366 V367 


6.34 6.37 6.40 6.41 7.56 7.61 7.60 


Y343 R346 K349 S350 L399 N404 L403 
Q308 K311 K314 A315 F362 N367 L366 
Q304 K307 K310 A311 F358 N363 F362 
M353 K356 K359 A360 1406 N411 C410 
L330 K333 R336 A337 1383 N388 F387 
Y335 R338 R341 S342 L391 N396 1395 
1345 R348 K351 $352 L401 N406 V405 
Y379 R382 K385 S386 Q435 N440 A439 
1347 R350 R353 $354 L399 N404 L403 
P335 R338 R341 S342 L387 N392 L391 
Q322 R325 K328 $329 L374 N379 L378 
H336 R339 R342 S343 L387 N392 L391 
Q323 R326 K329 S330 L375 N380 L379 
Q402 K405 K408 S409 1458 N463 C462 
Q357 K360 K363 S364 1412 N417 C416 


Extended Data Figure 5 | Conservation of residues in the MK-0893 residues in a coloured in rainbow spectrum according to conservation 
bipartite allosteric pocket. a, Sequence alignment of 15 human GPCR level (red = 100%; blue = 0%). MK-0893 is in stick representation with 
class B members across residues constituting the MK-0893 bipartite carbon, nitrogen, oxygen and chlorine atoms coloured purple, blue, red 
binding site. UniProt accession numbers are given. b, The GCGR and green, respectively. 


MK-0893 allosteric binding site is shown in surface representation with 
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Extended Data Figure 6 | Expression of GCGR mutants. Cell-surface and calculated as percentage of wild type. Experiments were carried out 
expression of GCGR mutants determined using FACS. Data are expressed in triplicate and error bars indicate s.e.m. P values are derived from an 
as the ratio of APC (cell surface expression) to GFP (total expression) unpaired two-tailed t-test. 
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Extended Data Figure 7 | Saturation binding analysis of mutants with independent experiments and error bars represent s.e.m. P values are 
[3H]MK-0893. a-g, Saturation binding of [*>H]MK-0893 to membrane calculated from a two-tailed t-test. The data set for R346E and N404A did 
containing the indicated variants of GCGR. Data are representative not fit the one-site binding unambiguously due to near complete loss of 
of three independent experiments. pKy values are average of three specific binding. 
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Extended Data Figure 8 | Effect of binding-site mutations in functional _ half-maximum effective concentration (ECso) allosteric shift equation. 
assay. Cells expressing either wild type or the indicated mutants were pg is the negative log of the antagonist affinity that mediates the 
stimulated with a concentration range of glucagon in the presence of allosteric inhibition of the glucagon response. Data are an average of three 
increasing concentrations of MK-0893. After determination of levels independent experiments and error bars represent s.e.m. P values are 
of cAMP generated, the data were analysed by global fitting to the derived from an unpaired two-tailed t-test. 
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Extended Data Table 1 | Data collection and refinement statistics for 
GCGR-StaR(136-417)-T4L-MK-0893 


Data collection 
Number of crystals 
Space group 
Cell dimensions 
a, b, c (A) 
a, By (") 
Number of reflections measured 
Number of unique reflections 
Resolution (A) 


Rmerge 


** 


9 
P2i272. 


37.6, 71.5, 183.1 
90.0, 90.0, 90.0 
52,568 
16,065 
32.73 - 2.50 (2.60 - 2.50) 
0.157 (0.768) 
0.984 (0.469) 


Mean I/sd(I) 6.0 (1.6) 
Completeness (%) 91.1 (90.2) 
Redundancy 3.3 (3.3) 
Refinement 
Resolution (A) 19.97 - 2.50 
Number of reflections (test set) 15,222 (777) 
Ric! Pires 0.226 / 0.263 
Number of atoms 
All 3,642 
Protein 3,353 
Ligand 4 
Others (Lipids, ions, waters) 248 
Average B factors (A’) 
All 66.7 
GCGR 47.8 
T4L lysozyme 101.2 
Ligand 37.2 
Others (Lipid, ion, water) 51.0 
RMSD 
Bond lengths (A) 0.0038 
Bond angles (°) 0.799 
Ramachandran statistics 
Favored regions (%) 97.8 
Allowed regions (%) 2.2 
Outliers (%) 0.0 


MolProbity overall score (percentile) 1.25 (100th percentile) 


Values in parenthesis indicate highest resolution shell. 
**CC_, - see Diedrichs & Karplus, Acta. Cryst. (2013). D69, 1215-1222. 
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Extended Data Table 2 | Further pharmacological characterizations of different GCGR constructs 


a 
| | ane sconces. || gene 
(136-417) (136-417)-T4L 

b 


8.9 (0.01) | 8.86 (0.23) | 8.78 (0.04) | 8.84 (0.11) | 9.15 (0.01) | 9.00 (0.02) | 8.86 (0.01) 


a, Analysis of MK-0893 and Cpd-03 binding to different GCGR constructs. pKj values are calculated from competition studies with [>H]MK-0893 using membranes isolated from HEK293T cells tran- 
siently expressing the indicated constructs. Data are shown as the mean of three independent experiments with standard deviation displayed in parentheses. b, Analysis of glucagon binding to 

wild type and allosteric binding site mutants. pKg values are calculated from homologous competition studies with [!25!]glucagon using membranes isolated from HEK293T cells transiently expressing 
the indicated constructs. Data are shown as the mean of two independent experiments with standard deviation displayed in parentheses. 
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CORRECTIONS & AMENDMENTS 


CORRIGENDUM 
doi:10.1038/nature16543 


Corrigendum: Essential roles 
of PI(3)K-p1108 in cell growth, 
metabolism and tumorigenesis 


Shidong Jia, Zhenning Liu, Sen Zhang, Pixu Liu, Lei Zhang, 
Sang Hyun Lee, Jing Zhang, Sabina Signoretti, Massimo Loda, 
Thomas M. Roberts & Jean J. Zhao 


Nature 454, 776-779 (2008); doi:10.1038/nature07091 


In Fig. 3b of this Letter we inadvertently used the wrong images (partial 
duplicates of images representing p110a‘/~ mice) in the panels repre- 
senting p1103°’~ mice. The corrected Fig. 3b is shown in Fig. 1 of this 
Corrigendum. The conclusions of the paper are not affected; knocking 
out either p110a or p110 does not affect the growth of the normal 
prostate. 


Pten’ — Pten 
Control p110a"- ——p110B-- ~—— Pten-/- p110a~—p110B-- 


Figure 1 | This figure shows the corrected Fig. 3b of the original Letter. 


278 | NATURE | VOL 533 | 12 MAY 2016 
© 2016 Macmillan Publishers Limited. All rights reserved 


CORRECTIONS & AMENDMENTS 


CORRIGENDUM 
doi:10.1038/nature16999 


Corrigendum: Structural and 
functional features of central 
nervous system lymphatic vessels 


Antoine Louveau, Igor Smirnov, Timothy J. Keyes, 

Jacob D. Eccles, Sherin J. Rouhani, J. David Peske, 

Noel C. Derecki, David Castle, James W. Mandell, Kevin S. Lee, 
Tajie H. Harris & Jonathan Kipnis 


Nature 523, 337-341 (2015); doi:10.1038/nature14432 


We would like to correct this Letter, which demonstrated the molecular 
characteristics and functional nature of lymphatics serving the central 
nervous system (CNS), by adding two reference citations, of which 
we became aware after publication. Foldi et al.! postulated a CNS- 
lymphatic link, although their findings described lymphatic vessels 
located at the base of the skull, which differs from the location of the 
vessels that we identified and functionally characterized. Andres et al.” 
presented incidental histological evidence of dural vessels in rats that, 
on the basis of ultrastructural features, appeared to be lymphatic. We 
also wish to clarify that the designation of these lymphatics as ‘CNS 
lymphatics in our Letter refers to their function of CNS/cerebrospinal 
fluid drainage. 


1. Fdldi, M. et al. New contributions to the anatomical connections of the brain 
and the lymphatic system. Acta Anat. 64, 498-505 (1966). 

2. Andres, K. H., von During, M., Muszynski, K. & Schmidt, R. F. Nerve fibres and 
their terminals of the dura mater encephali of the rat. Anat. Embryol. 175, 
289-301 (1987). 
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CORRECTIONS & AMENDMENTS 


CORRIGENDUM 
doi:10.1038/nature17177 


Corrigendum: Bees prefer 
foods containing neonicotinoid 
pesticides 


Sébastien C. Kessler, Erin Jo Tiedeken, Kerry L. Simcock, 
Sophie Derveau, Jessica Mitchell, Samantha Softley, 
Amy Radcliffe, Jane C. Stout & Geraldine A. Wright 


Nature 521, 74-76 (2015); doi:10.1038/naturel14414 


In this Letter, the author Amy Radcliffe was acknowledged but errone- 
ously omitted from the author list. We wish to include her in the author 
list as shown, associated with the Institute of Neuroscience, Newcastle 
University affiliation. She contributed data to the choice experiments 
with bumblebees using the pesticide thiamethoxam. This has been 
corrected in the online versions of the paper. 
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Close inspection 


To improve your own papers, learn how to evaluate 
other scientists’ work. 


BY QUIRIN SCHIERMEIER 


efore she had even defended her 
B doctoral thesis, Brazilian student Rita 

Santos began to receive requests for 
her expert opinion. Her work on beak devel- 
opment in octopus larvae — along with her 
knowledge, care and keen judgement — had 
left an impression on scientists in the field 
and early on in her career, she was invited to 
become a peer reviewer. 

Matthias Starck, a zoologist at the Ludwig 
Maximilian University of Munich in Germany 
and editor-in-chief of the Journal of Morphology, 
sent an invitation to Santos after receiving a 


recommendation from her supervisor. “I was 
a bit hesitant at first,” he says, “but the reports 
she turns in are just superbly thoughtful and 
well written.” 

Peer review is the backbone of modern 
science, and academic researchers are expected 
to participate in the endeavour. Although time 
consuming, delving deeply into someone else's 
paper can benefit a scientist's own work. The 
process allows peer reviewers to read about 
research before it is generally known and to 
gain insight into how other scientists write 
manuscripts and present data. “I’ve learned a 
lot about science and the process of publishing 
it, says Santos, who studies marine ecosystems 
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at the Alfred Wegener Institute of Polar and 
Marine Sciences in Bremerhaven, Germany. 
“And you learn how to be critical without being 
impolite or discouraging to others.” 

Whether or not they plan to pursue an 
academic career, junior researchers should get 
involved in peer review, says Sarah Blackford, 
a career adviser with the Society for Experi- 
mental Biology in London. “Not only will it 
help you to hone your power of judgement,” 
she says, “but it is also a great way to broaden 
your knowledge and demonstrate transfer- 
able skills for offering an authoritative view to 
your peers”. 


HAND-ME-DOWN PAPERS 

Young scientists typically get their start as 
reviewers through supervisors or lab leaders, 
who may be overburdened or need to turn to 
junior team members who are familiar with 
specific methods or technology. Graduate 
students generally are not recognized for their 
ability to conduct independent peer review 
unless, like Santos, they are already establish- 
ing an academic reputation by publishing 
first-author papers. But they can gain expe- 
rience by helping their supervisors or senior 
colleagues to prepare reviews. 

“If Iam too busy, or a manuscript is a 
little outside my field, there is nearly always 
an opportunity to propose postdocs and other 
early-career researchers who have expertise 
in the area requested,” says Ros Gleadow, a 
plant physiologist at Monash University in 
Melbourne, Australia. “They might then get 
invited by the journal to conduct the review” 

Even if they aren't invited, another 
natural first step is to review a paper jointly 
with seasoned colleagues or under their 
mentorship, says Emma Ganley, co-editor- 
in-chief of the journal PLoS Biology. Senior 
scientists might be better placed to judge a 
finding’s weight and general significance, but 
junior researchers are often more up to date on 
methods and technology — proficiencies that 
any journal editor will appreciate. 

“Young reviewers are extremely good in 
raising technical issues such as those related 
to microscopy or molecular techniques,” says 
Bernd Pulverer, head of scientific publications 
at the European Molecular Biology Organi- 
zation (EMBO) in Heidelberg, Germany. 
Editors of EMBO journals encourage senior 
reviewers to involve trusted early-career lab 
members in peer reviews, provided that they 
have done experimentation in the relevant 
field. Their background experience will > 
12 MAY 
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> help them to carry out the key components 
of peer review: they must be able to assess 
whether work is new to the field and original 
enough to deserve publication — and by the 
journal in question. 

They need to be able to evaluate the quality 
of data, look for potential inconsistencies and 
ascertain whether the methods and experi- 
ments are appropriate. If they see flaws or 
holes, they will be expected to suggest that the 
authors do more analysis or more experiments. 
And if they think that a paper is incomprehen- 
sible or biased (or plain tripe), they are obliged 
to tell journal editors just that. 

Reviewers will assess whether a study is 
conceptually valid or technically sound, if its 
arguments are coherent and if claims and con- 
clusions are sufficiently backed up by the data. 
Many journals will also ask whether the results 
challenge or confirm established concepts, and 
if they significantly advance the field at hand. 

“Dont try to dictate to us what we should be 
publishing, but do provide strong arguments 
and detailed justifications of any statements 
you make,” says Karl Ziemelis, chief physical- 
sciences editor at Nature. “Just saying that this 
or that isn’t a big deal in your field is much too 
vague. We would like to know why you think 
so, and how you came to that conclusion.” 


THE STARTING GATE 

New reviewers may be uncertain of what they 
are expected to produce and how overtly criti- 
cal they should be. “I knew I was to assess the 
scientific strengths and weaknesses of the 
manuscript,” says Santos. “But I wasn’t quite 
sure, at first, how deeply I should go into things 
like length, structure and language.” If they are 
at all confused, they should consult a seasoned 


Marine ecologist Rita Santos out in the field. 
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NUTS AND BOLTS 


Become a peer-review legend 


@ Formal courses in peer review are rare or 
absent, so seize the opportunity if lecturers 
offer exercises in discussing papers. Journal 
clubs are also a helpful way to gain some 
experience. 

@ To become a reviewer, you need to make 
yourself known. A good way to build up trust 
with journal editors is to approach them at 
conferences and meetings and show them 
your work. 

@ Once you get a manuscript, read it 
through once, carefully. Let it settle a day 
before you proceed. 

@ Establish whether the science is 
compatible with the scope of the journal. 

@ Outline the novelty of the science and 
judge the significance of the results: how do 
they advance the field? 

@ Comment on the quality of the science 
and validity of the results. 


reviewer, or contact the journal editor who 
commissioned the review, advises Ziemelis. 
They should also tell the editor if they feel that 
they might lack competence — or the time — 
to do a proper review. If the field in question 
is too distant from their own niche, they may 
need to decline to review a manuscript, or sug- 
gest someone who is more appropriate. 

“Do tell editors if you are happy to comment 
on one aspect of a paper but not on another, 
says Pulverer. Journal editors also appreciate 
it when a researcher recommends colleagues 
who might be better placed to evaluate a 
paper or any specific aspects of the science. If 
a peer reviewer brings in a student or technical 
specialist to help out, those people should be 
named as contributing reviewers. 

Similarly, Ziemelis says, researchers should 
tell the journal editor if they think that they are 
too closely affiliated with an author to judge the 
science neutrally. Any conflict of interest — per- 
sonal, financial or owing to direct competition 
— renders a scientist unsuitable as a reviewer. It 
is always better to over-declare than to under- 
declare, says Irene Hames, an independent 
publishing consultant in York, UK, and former 
director of an international organization called 
the Committee on Publication Ethics. 

Novice reviewers should also find out 
whether the journal offers ‘double-blind’ peer 
review, in which authors can request that their 
names and affiliations be withheld. A reviewer 
will need to decide whether she or he is com- 
fortable reviewing the work of an anonymous 
author. Conversely, in the case of ‘oper’ peer 
review, the author’s and reviewer's identities are 
disclosed. But this model offers new reviewers 
the chance to look at what others have written 
and how authors have responded to comments. 
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@ Ask yourself the following questions: 

Is the argument logical? 

Are the methods suitable 

and results plausible? 

Are the findings adequately 

described and discussed? 

Are the claims and conclusions 

justified by the data? 

Is the interpretation of the data 

appropriate in light of available theory? 

Have the authors conducted 

all appropriate controls? 

Is there adequate replication? 

Are key papers in the field cited? 
@ Give an opinion as to whether the paper 
should be published, revised or rejected. 
@ Describe any extra experimentation or 
data analysis needed to warrant publication. 
@ Ask journal editors for feedback: what was 
your review like? Was anything missing? 0.5. 


Ifa junior researcher is contacted bya journal 
that they have never heard of, they should be 
cautious. An invitation from what might bea 
new or relatively unknown small journal isn't 
necessarily a reason to decline, but journals with 
questionable peer-review and publishing stand- 
ards are increasing in number. If a journal says 
that it is open access, researchers should check 
whether a journal is listed on the Directory 
of Open Access Journals (www.doaj.org) or the 
Open Access Scholarly Publishers Association 
(www.oaspa.org). They should look for rec- 
ognized experts on a journal's editorial board, 
and contact them to verify credentials and peer- 
review standards. 


THE WRITE UP 

The review itself involves several steps (see 
‘Become a peer-review legend’). The first is 
to plan enough time and to stay in close con- 
tact with editors. There is no one-size-fits-all 
estimate for how long it takes to write a good 
review, but scientists should expect to spend at 
least eight hours and up to several weeks, say 
veteran reviewers. 

Sloppy work or unresponsiveness might 
prompt editors to drop a reviewer — which 
could mean losing the respect of peers and col- 
leagues and diminishing the chance of being 
added to editorial boards. It could also taint a 
researcher's reputation with editors of journals 
in which they may want in future to publish 
their own work. 

After an initial general read of the 
manuscript, novice reviewers should wait a 
full day or so before getting into the technical 
details and starting to draft a properly phrased 
review, says structural biologist Stephen Curry 
of Imperial College London. “Sit back and think 


RITA MELO FRANCO SANTOS 


UNIV. STRATHCLYDE 


how you would like a constructive review 
to be written if you were the author of the 
paper,’ he says. Snarkiness or scorn should 
not be present. “Derisiveness, aggressive- 
ness or rivalry have absolutely no place in a 
review, Curry adds. 

The document should start with a short, 
cohesive summary of the paper, says 
Pulverer, followed by comment on experi- 
mental design and the validity of controls. 
A key point of any review of biological 
work, he says, is whether the data and their 
interpretation support the reported find- 
ings. “Wed like reviewers to outline pre- 
cisely what extra tests they think are needed 
and why,” he says. Reviewers should also 
make clear whether more experiments are 
essential or merely desirable. 

The specific technical and editorial 
advice that reviewers are expected to pro- 
vide depends largely on the subject area and 
the scope ofa journal. Validating a twist in 
string theory or cosmology calls for a dif- 
ferent approach than reviewing the results 
of an astronomical observation, geological 
fieldwork or clinical trials. If asked to assess 
theoretical work, a reviewer should focus 
on equations and their interpretation. 

Most studies will require reviewers to 
examine observational and experimental 
data contained in supplementary material 
(or external reposi- 


tories) and their “Thinkhow 

representation in you wouldlike 

graphs and figures. a constructive 
Reviewersshould review tobe 

check guidelines writtenif 

for authors and youwere the 

reviewers carefully q@ythor.” 


to be sure that they 

properly understand a journal’s scope, 
how novel and ‘big’ any science must be to 
get published there, and whether referee 
reports and the authors’ responses will be 
published online. 

If the latter is the case, as it is for the 
EMBO journals, scientists should look at 
other reviews and authors’ responses. This 
is a good way for novice reviewers to get a 
sense of the appropriate length and structure 
expected, and of the journal's overall review 
process, says Hames. If such information is 
ambiguous or unavailable, they should ask 
the journal for specifics. 

Assessing the work of others nurtures 
critical thinking in ways that few other 
ventures can match. But at the end of the 
day, says Alaa Ibrahim, an astrophysicist 
with the American University in Cairo, it is 
good for authors to have others dissect their 
submitted work. “The worst thing,’ he says, 
“is that your science gets published just to 
be proven faulty or wrong soon after.” = 


Quirin Schiermeier is a Nature 
correspondent in Munich, Germany. 


TURNING POINT 


CAREERS 


Intelligence programmer 


Computer scientist Damien Anderson 
overcame a lengthy illness to pursue an award- 
winning PhD project in artificial-intelligence 
(AD) research at the University of Strathclyde 
in Glasgow, UK. After regaining his health, he 
had to wrestle with a crisis of confidence. 


What led you to study Al? 

I’ve been interested in computers since my 
dad bought a video-game console, when I was 
five. I grew up in a deprived area of Scotland 
called North Lanarkshire, so it was a big deal 
at the time. Later, I had serious health issues 
— undiagnosed pneumonia led to chronic- 
fatigue syndrome — which left me bed-bound 
from age 14 to 20. I replaced conventional 
education with the computer, teaching myself 
subjects that I was interested in. It was a negative 
time, but positive things came out of it. It gave 
me time to learn the things I wanted to learn. 


How did you move forward once you were well? 
When I had my strength back, I worked at a 
call centre fixing computers for four years. I 
decided that if I could do that, I was now physi- 
cally able to stick to a degree. My confidence 
had been zapped by being ill for so long. I 
wanted a piece of paper that said I was capable 
of doing more than answering phones. 


Can you describe your journey into university? 
The hardest decision I ever made was to go 
back into education. I didn't have high-school 
qualifications, and so I had to prove myself. I 
did a national qualification in the form of an 
introductory course to digital media, and then 
completed a two-year diploma at the City of 
Glasgow College — a gateway to university if 
you dont have enough qualifications. I focused 
on software-programming languages. After 
that, I was allowed to enter the University of 
Strathclyde as a second-year student. 


Were you intent on doing video-game design? 
Early on, yes. But when I got to the university's 
department of computer and information 
science, I was really impressed by the people and 
their projects, which included AI. I decided to 
do a software-engineering degree. But to be 
honest, my initial goal was just to get a degree. I 
approached it as if I just had to survive my time 
in university. It was a game of attrition, and I 
would beat it through pure persistence. 


What pushed you to do more? 

My undergraduate programme offered an 
optional placement year in industry. When I 
looked at the list of places that students had 
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gone before, CERN, Europe's particle-physics 
lab near Geneva, stood out. I was determined 
to do well — not just get through it. I studied 
harder to get the grades necessary. 


How was your time at CERN? 

It was a dream. I expected to be surrounded 
by Einsteins and Feinmans, but these are nor- 
mal, determined people like me, which was 
eye-opening. We were using real-time decision- 
making processes, called scrum, to develop 
machine-protection software for the Large Had- 
ron Collider. After seven months there, I was 
named scrum master — essentially, team facili- 
tator. It made me feel extremely valued. I came 
back after 14 months and finished a final-year 
project that won 2 awards, which helped me to 
secure funding from the Carnegie Trust for the 
Universities of Scotland to conduct a PhD. 


What are you working on now? 

The big hurdle in my field now is building AI 
systems that are able to solve a variety of prob- 
lems, including ones they’ve never seen before. 
The Google DeepMind team — which just 
announced that its AI, called AlphaGo, won 
against the world’s top Go player — is also fund- 
ing a competition to build Als able to solve more 
than one problem. I’m working on that. One of 
the best platforms to carry out that project is in 
video games, because there are so many types — 
from puzzles to role-playing to strategy. 


How have you handled the attention that your 
work has received? 

The publicity has at times got me way out of my 
comfort zone, but it’s a great confidence boost. 
I’ve decided to say yes to every opportunity. = 


INTERVIEW BY VIRGINIA GEWIN 


This interview has been edited for length and clarity. 
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VENICE, VERSION 9.0 


BY PRESTON GRASSMANN 


Sc through different versions 


of the city, I settle on a drowned 
vision of the Venetian streets. I 
wait for her at a table on the make- 
shift patio of a sunken cathedral, 
watching the avatars of tourists 
pass. She comes in a silk dress 
that flows behind her like swirls 
of smoke, her copper hair like 
fire. The illusion is broken only 
by the swiftness of her motion. 
As she sits down, I stare at the 
curve of her neck, the elegance 
of an angle I could never quite 
capture in code. 

“A bit sacrilegious, don’t you 
think?” she asks, looking up at the 
cathedral window, where stained 
glass frames the dusk with a scene from 
the last supper. 

“In their version of Venice, the servers 
are acolytes walking the aisles with candles 
and contribution trays.” The waiter appears, 
placing a glass of wine at the table. “This is 
a sacrament.” 

“You've made some changes here.” She 
points up at a virtual vaporetto, the engine 
sounds muted as if heard from several 
metres below the surface of the Adriatic. It 
slows down at the edge of the pier, letting the 
tele-present tourists disembark. They step 
out into the sea, transform into their cho- 
sen avatars and begin their descent to the 
sunken city. 

“Thank you for noticing. I've added a few 
things to make the experience more real.” 

She rolls her eyes, looking up at the Bridge 
of Sighs, where a school of exotic fish drifts 
slowly past the windows. “Do you even 
know what that means?” 

“The wine is real,” I say. “I can't say much 
for the tourists, though” 

“This is all a game,” she says, leaning for- 
wards, raising the glass in her hands and 
studying the wine swirling inside. “Faith- 
ful algorithms made into virtual-estate, 
illusions bought and sold on a phantom 
market.” 

“Do you really think the pre-virtual cen- 
turies were all that different? None of these 
monuments would have been built without 
games of power and influence. Do you know 
how many versions of Venice there were 
before the virtual cities came along? The 
‘Queen of the Adriatic, ‘City of Water; ‘City 
of Masks; ‘City of Bridges’ ‘The Floating 
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A tale of multiple cities. 


City’... Shall I go on?” 

“Why did you bring me here?” 

“What ifI could bring back the old Venice, 
the one from your own childhood?” 

She stops swirling her glass and stares 
at me over the rim. “What are you talking 
about?” 

The scene fades from view, colours bright- 
ening over the streets and buildings, resolv- 
ing into another version of the city. Sitting 
where we are, outside the cathedral, I let the 
new vision open slowly. This is the Venice 
of her memories, where she often walked 
with her mother when she was a child, the 
old storefronts with their handmade signs 
— shops that sold silver-plated tableware, 
perfumes, women’s hats and ornate glass, 
gleaming brightly behind the windows. She 
recognizes the name on the sign near the 
cathedral, the canal where she once lived. 

“No one inhabits this version yet, so the 
people you see here are default avatars.” 

“You've turned my memory into an algo- 
rithm,’ she says. 

“I thought you would've liked this. This 
is the only version that will ever exist, out- 
side of your own memory.” 

“But it’s not real,” 


> NATURE.COM she says, tapping the 
Follow Futures: table with her fin- 
© @NatureFutures ger. A metal sound 


Ei go.naturecom/mtoodn © clanged beneath the 
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image of the wood. “You've assembled this 

out of my head.” 

“Tve reconstructed part of it from your 

memories. Other parts I’ve filled in 
with recordings, pictures and public 

records.” 

“I don't want to live in my own 

memories. You should have left 

them where they were” 

“When you told me about 
your memories of Venice, you 
said that you wished you could 
be back in that time. I thought 
you would’ve liked this...” 

“It’s not just about the time, 

but the people who inhabit that 

space. What you've created is only 
a simulation. It’s not real” 

“Tell me, what is reality? Your 
eyes filter everything but the narrow- 
est range of the light. Everything that 

you hear is only a fragment of the sound 
that’s around you at each moment. What 
you call ‘rea? comes from a perception that 
constantly changes, altered by chemistry and 
memory.” 

“Look,” she says, reaching out to pass 
her hand through me. “This is what I know 
is real. I can't feel you. I can’t touch you. 
There is nothing to hold on to.” She stands 
up slowly. She brushes a hand across her 
cheeks. The tears are like glass at the edge of 
her eyes, shaped by forces I can’t understand, 
insoluble in the virtual sea. “I need to be out 
there in the world, even if it is broken?” 

I watch her walk away, her copper hair 
drifting in the wind of another Venice, the 
nape of her neck a question in the coded 
landscape of her memory. The only answer 
[have is that I am also an artefact assembled 
from memory, an algorithm made out of 
silicon instead of biology. 

I nod to the waiter, placing the virtual 
money on the table. In another version of 
the world, I’ve placed an offering in a dona- 
tion tray. In her version, I don't exist. 

The waiter bows swiftly and turns away, 
walking down the pews of an old cathedral. 

Back in the sunken city, I stare up at the 
stained-glass window until the dusk falls. = 


Preston Grassmann became a freelance 
writer after working as a regular reviewer 
for Locus Magazine. His most recent 

work has been published in AE: Canadian 
Science Fiction, Daily Science Fiction, 
Mythic Delirium and Slave Stories: Scenes 
From the Slave State. 
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ecrecy within the world of drug discovery and 

development is no longer as important as it once was. 

As development of therapies has become more difficult 
and costly, academics and industry competitors have begun 
to engage in greater collaboration and to embrace openness to 
accelerate research. 

The road to a new drug is littered with expensive dead ends. 
Costly mistakes are often hidden from view and made multiple 
times in different laboratories (see page S54). To reduce 
this inefficiency, competitors are working together on basic 
research in pre-competitive partnerships (S56). Early research 
is being aided by the release of tools to explore potential drug 
targets. The freely available chemical probe JQ], for instance, 
has sparked more innovation than it ever would have had it 
been kept locked away (S60). In the past decade, big pharma 
has come to accept that it must look for ideas beyond its walls 
if companies are to continue to innovate (S59). Hundreds 
of people, often without experience of the field, are entering 
competitions to solve complex biological problems (S62). And 
in an effort to screen the many compounds created each day, 
some companies have committed to examine the molecules 
without making claims over intellectual property (S65). 

Research of neglected and tropical diseases is some of the 
most open. Open-source projects are publishing every step, 
and misstep, for all to see (S68). Despite these developments, 
some say that the life sciences are not as open as they should 
be (S70). One Canadian institute is preparing to test whether 
abiding by the principles of openness, including refusing 
to patent any of its discoveries, can work in a world where 
academics are expected to extract commercial value from 
their work (S71). 

We are pleased to acknowledge the support of Boehringer 
Ingelheim in producing this Outlook. As always, Nature 
retains sole responsibility for all editorial content. 
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A NEW CHAPTER IN INNOVATION 


A growing appreciation that cooperation and competition can coexist is transforming the 
life-sciences innovation landscape. Development was once shrouded in secrecy, but now 
organizations are coming together. By David Holmes; illustration by Mohamed Ashour. 


STRAIGHT PATHS OF OLD 


Historically, life-sciences 
partnerships were linear and 
closed in nature. The 
conception, development and 
delivery of therapies was 
handled separately — the 
involvement of one partner 
ending as that of another 
began. 


BASIC RESEARCH 


This arrangement had its 
advantages. Risk and reward 
increased as the drug moved 
down the chain. But the lack 
of dialogue with the wider 
community could often result 
in costly duplications of effort. 


DISCOVERY & 
DEVELOPMENT 


TREATMENT 


MOORE PROBLEMS 


The number of drugs approved by the US Food and 
Drug Administration (FDA) per billion US dollars spent 
on research and development (R&D) has halved roughly 
every 9 years since 1950 — seemingly the opposite of 
Moore’s law (Gordon Moore, co-founder of technology 
company Intel, observed that the number of transistors 
that can be placed, inexpensively, on an integrated 
circuit doubles every 18-24 months)’. In an effort to 
defy ‘Eroom’s law’, organizations have been forced to 
search for ways to boost returns on investment. 


100 


FDA tightens regulation after 
the thalidomide scandal. 
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FDA begins to 
clear backlog 
of applications. 
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SQUEEZED MARGINS 


Increasing R&D costs and 
competition from drugs that are no 
longer covered by patents have 
squeezed the margins of the 
conventional linear innovation 
model. 


INCREASING INFORMATION 


As researchers better understand 
the biological complexity of targeted 
diseases, and the technological 
complexity of drug discovery and the 
size of data sets increase, 
organizations are becoming more 
reliant on outside expertise. 


UNMET NEEDS 


A conventional closed model of 
innovation is unsuitable for some 
medical needs, such as the 
development of antimalarials or 
Ebola vaccines. 


A GROWING MOVEMENT 


In response to the limitations of the closed model 
of innovation, a growing number of organizations 
have come together over the past two decades to 
share ideas, data and tools with each other, and 
often with the wider community. 


——— 1990 


1998 
Division of Signal 
Transduction Therapy 


1999 

Single Nucleotide 
Polymorphisms 
Consortium 


1995 


——=  40/0)0) 


2002 
International 
HapMap Project 


2005 


2004 
Structural 
Genomics 
Consortium 


2010 


2011 
Open Source 
Malaria Project 


OPEN TO INTERPRETATION 


OPEN INNOVATION Meltyimelelie 


According to Google’s Ngram Viewer, which shows the relative frequency with which certain words and phrases are published, the phrase ‘open innovation’ gained 
prominence in the English language in 2003 — the year that business specialist Henry Chesbrough, now at the University of California, Berkeley, used it to describe 
the concept that companies should use external ideas as well as internal ones to advance their technology. But open innovation is just one of the terms that seeks to 


capture different, but overlapping, themes of openness in the broader innovation ecosystem. 
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OG Pioneered by the software-develop- 
fe 0.90 -- ment industry, open-source projects 
S O20- make the fruits of open collaboration 
S free to everyone, with little or no 
OO aS direct profit for the collaborators. 
POK@eeemes Henry Chesbrough (asia ilasasaaaa aaa 
5 0.50. uses the term 
S open innovation. 
| Fa 
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OUOr... to the conventional publishing 


model. Literature is digital, online, 
free at the point of use and usually 
free to reproduce in some form. 
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OPEN DATA 


The Open Data Institute in 
London describes open 
data as data that anyone 
can access, use and share. 


Open science is the practice of 
science such that others can 
contribute. Lab notes are freely 
available, and the research and its 
underlying methods can be shared. 


DEGREES OF OPENNESS 


Whether or not a collaboration qualifies 
as an example of open innovation can be 
subjective, but criteria (pictured)? 
devised by the Wellcome Trust in London 
can be used to categorize collaborations. 
It is generally agreed that the ideal open 
initiative would be a broad collaboration 
with public participation through all 
stages of innovative development, 
resulting in an output that is free to all. 


Whom a collaboration 
is open to can vary, 
from an internal 
collaboration within 
an organization to a 
project fully open to 
anyone. 


PARTICIPATION 


Open to the public 5 

International collaborations with multiple partner types 4 

Multiple partner types in a similar geography or limited 3 
number of international partners 

Limited number of partners (<10) within a similar geography 2 


Internal collaboration within a single organization 


Narrow focus within a single R&D stage (usually very early stage) 
Focus on early conceptualization (proof of mechanisms) 


Pre-clinical or clinical studies (not both) 


Pre-clinical and some clinical studies 


Pre-clinical, clinical and 
commercial development 


Openness can vary depending on the stage of the 
development process. Some projects are open until 
clinical studies begin, at which point access to data 

becomes tightly controlled. Open collaboration in other 

projects may persist through to final release. 


For many, the right to access the fruits of a 
collaboration is central to whether it can be 
considered open. In some cases, individual partners 
retain intellectual-property rights, whereas in others 
results are made available to all. 


CASE STUDIES 


Intellectual property is shared and retained within the partnership 


Intellectual property generated before and as a result of 
the partnership is shared within the partnership only 


Some of the resulting intellectual property is protected 
and made available to the public 


A completely open approach is adopted in 
the partnership 


Sources: 1. Tufts Center for the Study of Drug Development; 2. J. W. Scannell et al. Nature Rev. Drug Discov. 11, 191-200 (2012); 3. R. Pigott et al. Shaping the Future of Open Innovation (Wellcome Trust, CASMI & Kinapse, 2014). 
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Unlikely partnerships 


Drug discovery is time-consuming and full of blind alleys. Pharmaceutical rivals are 
cooperating in the early stages to accelerate and improve the efficiency of the process. 


BY NEIL SAVAGE 


he protein ubiquitin, as its name 

| suggests, is found in almost all living 
tissue. It plays an important part in 

the death of old or damaged cells, by attach- 
ing to other proteins and labelling them for 
destruction. Failure to mark proteins in this 
way can lead to inflammation, cancer or 
neurological disorders such as Alzheimer’s 
disease. If scientists can unravel the myster- 
ies of this pervasive molecule, they may find 
new targets for drugs to treat these diseases. 
Pharmaceutical companies are already 
selling three drugs that target processes 
involving ubiquitin as a way to treat the 
bone-marrow cancer multiple myeloma: 
bortezomib, approved by US regulators in 
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2003; carfilzomib, approved in 2012; and 
ixazomib, which got the nod last November. 
But because the system for attaching and 
detaching the protein has so many mov- 
ing parts, including 2 activating enzymes, 
about 40 conjugating enzymes and some 
600 ligases, there may be many more thera- 
peutic targets still to be found. With so 
much to study, researchers at the Univer- 
sity of Dundee, UK, and six pharmaceutical 
companies are collaborating to share their 
resources and findings, in the hope of gain- 
ing insights that will lead to new drugs. “I 
think this is going to become pretty big,” 
says Philip Cohen, a biochemist at Dundee 
and one of the leaders of the collaboration, 
called the Division of Signal Transduction 
Therapy (DSTT). “The things we discover 
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here will be helpful to alleviate disease and 
also to generate a lot of money, not only for 
the companies, but for our own research.” 

The DSTT is a pre-competitive partner- 
ship — a type of collaboration in which 
pharmaceutical companies join together 
with one another, and often with academic 
researchers and the support of government 
funders, to tackle questions that they hope 
will lead to therapies. The idea is to share 
the cost of making early-stage discoveries, 
such as identifying biomarkers or disease 
pathways, that lay the groundwork for 
drug development. Armed with such basic 
knowledge, the companies can then iden- 
tify specific molecules that might make 
drugs and study them in-house, developing 
proprietary therapies. 


BRATISLAV MILENKOVIC 


FRIDA YGGESETH 


Beyond sharing costs, partnerships can 
also help with the sheer volume of biologi- 
cal information now collected. This will 
only continue to grow as DNA sequencing 
of individuals becomes more widely avail- 
able. “The work is very large, and no sin- 
gle company or academic group can do it 
alone,” says Sylvain Cottens, who heads 
the Center for Proteomic Chemistry at the 
Novartis Institutes for Biomedical Research 
in Basel, Switzerland. If academic and indus- 
try researchers can pool their resources and 
share skills, they may be able to improve effi- 
ciency and speed up the creation of therapies 
for a wide variety of diseases. 


THE COST OF FAILURE 
Most drug candidates go nowhere. In 2004, 
the US Food and Drug Administration 
(FDA) estimated that only 8% of the com- 
pounds that enter phase I trials — many of 
which have been in development for more 
than a decade — actually make it to market. 
In 2015, the industry group Pharmaceutical 
Research and Manufacturers of America put 
that figure at less than 12%. The average cost 
of developing a drug in the first decade of the 
twenty-first century was US$2.6 billion — up 
from an average of $1 billion in the 1990s (see 
‘Under pressure’). 

Pre-competitive partnerships could be a 
way to dramatically improve the efficiency 
of drug development. For starters, they could 
reduce the large amount of duplication. 
Companies conduct their research in secret 
and tend not to publish the results of failed 
studies, meaning that other groups are likely 
to follow the same fruitless lines of inquiry. 
“If ten companies are working on Alzhei- 
mer’s disease on exactly the same target and 
it’s failed, that’s ten times the investment that 
is down the tubes,” says molecular biologist 
Pierre Meulien, who heads the Innovative 
Medicines Institute 
(IMI) in Brussels, a 


iT 
partnership between clon 
the European Commis- system rae not 
sion and the European been designed 
Federation of Pharma- with open . 
collaboration 


ceutical Industries and 
Associations. Because 
of the secrecy, it’s dif- 
ficult to come up with specific figures for 
duplication. But in 2009, it was estimated 
that 85% of research is wasted (amount- 
ing to $170 billion worldwide each year), at 
least some of which is because of failed or 
redundant studies (P. Glasziou & I. Chalmers 
Lancet 374, 86-89; 2009). 

Greater openness could reduce redun- 
dancy and save money, as well as spare 
patients from enrolling in trials that are 
doomed to fail. But as Cohen sees it, the 
real promise of pre-competitive partner- 
ships is improving our understanding of 
the biological mechanisms that underlie a 


in mind.” 


OPEN INNOVATION Meltyimelelie 


Protein structures solved by the Structural Genomics Consortium are made publicly available. 


particular process. The DSTT’s focus on 
deubiquitinating enzymes, which modify the 
effect of ubiquitin, is already helping to speed 
up the discovery of candidate drugs. One 
company, Dundee-based Ubiquigent, has 
already been formed to provide drug-devel- 
opment companies with assays and reagents 
developed by researchers at the University of 
Dundee. Cohen hopes that deubiquitinating 
enzymes will follow the path of kinase inhibi- 
tors, which the DSTT also studies. Once the 
first kinase inhibitor, imatinib, was approved 
by the FDA in 2001 to treat chronic myeloid 
leukaemia, researchers began devoting more 
resources to them. Since then, more than 
25 drugs that target kinases (which help to 
control the function of certain proteins) have 
been approved. 

The DSTT, which was formed in 1998, is 
made up of AstraZeneca, Boehringer Ingel- 
heim, GlaxoSmithKline, Janssen Pharmaceu- 
tica, Merck KGaA, Pfizer and 20 academic 
research teams, and according to Cohen is 
probably the longest running collaboration 
of its kind. Under the terms of the agree- 
ment, all unpublished results are shared 
between the collaborators, along with rea- 
gents, technology and technical know-how. 
Faculty and students must sign confidenti- 
ality agreements regarding the companies’ 
intellectual property, although they can still 
publish papers based on the collaboration’s 
research. The first drafts of articles are shared 
ona private website. Any member that wants 
a head start on development and patenting 
before the information reaches the public 
has 45 days to request a 9-month publication 
delay. Cohen says that the number of papers 
delayed is low — perhaps around 10 out of 
the past 400 — and that in practice, the delay 
is not as long as it seems because researchers 
tend to share drafts at earlier stages than they 
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would submit them to journals. “We actually 
think that the threat of this delay has caused 
us to publish more effectively and efficiently,” 
he says. 

By contrast, the public-private partner- 
ship the Structural Genomics Consortium 
(SGC) has no members-only viewing period. 
Its policy is to release data to its members 
and the rest of the world simultaneously, 
with no restrictions on use. The SGC also 
promises never to patent anything. “That 
openness does lead to faster science,” says 
Aled Edwards, a protein biochemist at the 
University of Toronto, Canada, and founder 
of the SGC. The SGC determines a protein’s 
structure and publishes the information in 
the international repository the Protein Data 
Bank. Under the policies of many scientific 
journals, anyone who describes a protein 
structure must deposit their data in the bank 
to make it available to all researchers. In the 
past 12 years, the SGC has deposited more 
than 1,500 descriptions of protein structures, 
from both humans and parasites, into the 
data bank. It also develops antibodies and 
chemical probes — small molecules that test 
how a potential drug interacts with biological 
targets (see page S60). 

SGC members all have different strengths. 
Academic researchers, Edwards says, are 
good at making basic discoveries, but have 
no incentive to take them beyond published 
papers. Participating pharmaceutical com- 
panies, however, are much more focused 
on creating marketable therapies. They are 
very good at high-throughput screening 
of drug candidates, but don’t spend much 
time on the most basic science. “We do get 
knowledge from the academic groups, but 
we also provide knowledge about how to 
develop assays,” says Cottens — Novartis is 
an SGC partner. 
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UNDER PRESSURE 


The pharmaceutical industry estimates that it is now more expensive to bring a drug to market than ever. Research and development (R&D) investment by members of 
the trade group the Pharmaceutical Research and Manufacturers of America was more than US$50 billion in 2010 fi. Pressure from generics (identical or equivalent 
versions of drugs for which patent protection has expired) that are typically substantially cheaper than branded options is also increasing (J. But the number of drugs 
coming to market each year is rising — 2014 and 2015 saw the highest number of US Food and Drug Administration approvals in the past ten years Eq. 


R&D investment in 
2010 was 25 
40 times that of 1980. § 


30 -- 


Total R&D spend (US$, billions) 


TOR 


1980 


1990 2000 2010 


Unlike the DSTT and the SGC, the IMI has 
no general rules about intellectual property. 
Instead, Meulien says, details of what can be 
shared and what stays proprietary are agreed 
in advance between collaborators ona given 
project. “We havea whole spectrum of types 
of arrangements.” 

Between 2014 and 2024, the IMI will 
receive €1.6 billion ($1.82 billion) from the 
European Commission and €1.4 billion from 
European pharmaceutical companies. This 
will fund projects that focus on neurological 
conditions, diabetes, cancer, tuberculosis, 
obesity, vaccine safety, the use of stem cells 
in drug discovery and antimicrobial resist- 
ance, among others. One IMI effort, the 
European Lead Factory, provides small- and 
medium-sized companies, as well as academ- 
ics, with free access to half a million chemi- 
cal compounds, which they can use to screen 
potential drug targets. 


NOT HITTING THE PRICE POINT 

Although a promising approach to point 
the way to therapies, partnerships that focus 
on fundamental science may have little 
impact on the overall cost of research. This 
is because it is during clinical trials, rather 
than early research, that most drug candi- 
dates fail. The SGC, for instance, is “at a fairly 
inexpensive part of the pharmaceutical dis- 
covery process,” Edwards says. 

So, in 2011, he and SGC chief scien- 
tist Chas Bountra, a translational medi- 
cine specialist at the University of Oxford, 
UK, joined with Sage Bionetworks, a non- 
profit biomedical research organization in 
Seattle, Washington, to form a partnership 
they called Archipelago to Proof of Clinical 
Mechanism (Arch2POCM). The idea was to 
extend pre-competitive cooperation on a few 
drug targets into phase II clinical trials, after 
which the risk of failure drops substantially. 
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But the vision proved too ambitious, says 
chemist Thea Norman, Sage’s director of stra- 
tegic development. She says that the phar- 
maceutical companies worried that it might 
prove too difficult to base intellectual prop- 
erty on compounds and information that 
would be in the public 


domain. “Theideawas “We hope 
anewoneandonethat foreally 

maybe at first glance transformthe 
fora pharmaceutical ecosystem 
company takes a little for how these 
explaining,” she says. things are done 


“We had at least two 
pharmaceutical com- 
panies that were ready to sign up, but we felt 
we needed a little more critical mass than 
that.” To get enough funding for what they 
had in mind, she says, they needed three to 
five companies on board. 

When the first approach turned out to be 
more than the industry was willing to sign 
up to, Arch2POCM’s founders launched a 
smaller-scale project, but one that still went 
beyond previous collaborations. The group 
began a 3-year UK effort in late 2013 with 
the Institute of Cancer Research and New- 
castle University, with funding from Can- 
cer Research UK and the Avon Foundation 
for Women, but with no pharmaceutical 
companies involved. The scaled-back pro- 
gramme aims to find a candidate compound 
that works on the enzyme KDM4B, which is 
implicated in people with breast cancer, but 
wont take the compound all the way through 
phase II trials. Norman says that the hope 
is that, whatever the scientific outcome, the 
project will demonstrate that cooperation 
can benefit drug development beyond the 
earliest stages of discovery. 

Another way to maintain the openness 
between pharmaceutical companies fur- 
ther into the drug-development process 


in Europe.” 
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may be to change financial incentives. Liza 
Vertinsky, who focuses on intellectual prop- 
erty at Emory University in Atlanta, Geor- 
gia, says that current patent law encourages 
companies to jealously guard their intel- 
lectual property, because if they lose patent 
protection they could lose out on the prof- 
its that come with a successful drug. “The 
patent system has not been designed with 
open collaboration in mind, so the mecha- 
nism of how you would share intellectual 
property is not built into the system,” she 
says. An alternative would be for lawmakers 
and courts to develop a concept of fair use, 
analogous to laws that allow people to quote 
a passage from a novel or sample a snippet 
of a song while not violating copyright, for 
example. That way, she argues, companies 
could share some portion of their research 
findings without giving up all claims to their 
intellectual property. Vertinsky intends to 
look more closely at pre-competitive part- 
nerships in the coming year to better under- 
stand how changes in the law might affect 
the way they work. 

Even if the cooperation between 
pharmaceutical companies cannot be 
expanded further, pre-competitive partner- 
ships are still having a positive effect on drug 
discovery, participants say. Although Cottens 
won't go into specifics about propriety work, 
he says that the collaboration “has clearly 
accelerated” the projects that Novartis is 
working on. Meulien says that these efforts 
are already helping to translate academic 
knowledge into practical applications. “We 
hope to really transform the ecosystem for 
how these things are done in Europe,’ he 
says. “We do things that no one company or 
university could do alone? = 


Neil Savage is a freelance writer based in 
Lowell, Massachusetts. 
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FASTERCURES, A CENTER OF THE MILKEN INSTITUTE 


Q&A Bernard Munos 
Change big pharma 


In 2006, pharmaceutical innovation consultant Bernard Munos helped to launch a lively public 
discussion about how open innovation can bring novel drugs to market with his paper ‘Can open- 
source Re>D reinvigorate drug research? He tells Nature how things have changed since then. 


How has the research landscape changed over 
the past decade? 

Until ten years ago, innovation was 
controlled by the big pharmaceutical firms, 
because they were the only organizations able 
to afford it. Today, there are all kinds of new 
players and research models that have spread 
innovation to all parts of the ecosystem. 

Big pharma used to be inward-looking 
about their research pipelines; overwhelm- 
ingly, their ideas came from inside the com- 
pany. But the number of drugs approved was 
too low to secure the future of the industry. 
Companies realized that they needed more, 
better and cheaper innovation. Now, there is 
a lot of openness. Some firms have adjusted 
their research model to effectively tap the 
global brain. The most successful are willing 
to contemplate totally new science. 


What differentiates the companies that do so? 

It comes down to leadership. Take Paul 
Stoffels, chief scientific officer of pharmaceu- 
tical company Johnson & Johnson (J&J). He 
championed the idea of Janssen Labs, an out- 
reach organization that is run alongside J&J’s 
research. Janssen Labs reaches out to those 
who do breakthrough science, and lets them 
know that they can avail themselves of the 


scientific, and potentially financial, support 
that J&J can provide. It hosts these partners 
in innovation incubators, which nurture doz- 
ens of companies with the potential to come 
up with game-changing therapies. The drug 
candidates born of these projects have not yet 
percolated through the J &J pipeline, but when 
they do, the company’s drug output could be 
multiplied several-fold. 


How else is drug innovation changing? 

You've got all kinds of groups doing viable 
research now. Universities are tired of see- 
ing their great ideas languishing on the shelf 
because they fall outside the comfort zone of 
big pharma. They want to give those ideas a 
fair chance, so they are engaging in transla- 
tional research themselves. 

Patient advocacy organizations are taking 
things into their own hands and launch- 
ing credible drug research and development 
(R&D) programmes that run ona shoestring. 

Venture philanthropists, who for personal 
reasons may have an interest in certain dis- 
eases, are willing to part with large amounts 
of money to fund initiatives to help with 
those diseases. All of this not only enlarges 
the global pipeline, but also changes the eco- 
nomics of drug R&D. 
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So, is the process now less dependent on the 
financial power of the big companies? 

The drug-discovery process has become a lot 
more open. You can be a one-person pharma- 
ceutical company today. You have access to 
many of the same databases and resources as 
your colleagues in the pharmaceutical industry. 
You can also use crowdsourcing platforms to get 
the input you need at a very attractive price. I 
know one company that went from creation 
through to phase I clinical trials with only 
6 people and about US$70 million in cumula- 
tive funding. That is a puny amount compared 
with what it used to cost. 


How will the move towards patient-gathered 
data disrupt the industry? 

Biosensing technologies allow patients to be 
monitored at home, generating vast amounts of 
high-quality data at very low cost. The impact 
of these technologies on clinical research will be 
huge, and the speed at which it is happening is 
astonishing. These technologies will change the 
industry. They will reduce the cost of research 
and erase much of the advantage that scale used 
to confer on big companies. 


What happens when patients take a more 
active role in the control of their clinical data? 
This will be game-changing. Whoever 
controls data collection and access will ulti- 
mately control innovation. Patients realize 
that they can play a central part there, and 
are organizing themselves accordingly. This 
is quite a threat for industry, because patients 
have different values to pharmaceutical com- 
panies. Patients are pushing their values on 
the clinical research enterprise, and they are 
values of openness, speed and convenience. 


And the value of affordability? 

Yes, the cost of innovation is still a big 
problem. Of the pharmaceutical needs in 
the United States, 90% are met by generic 
drugs costing $70 billion; the remaining 10% 
cost $350 billion. That should be concern- 
ing to pharmaceutical leaders. Their new 
drugs often treat only thousands of patients, 
whereas the old blockbusters treated mil- 
lions. And how can you grow a $350-billion 
revenue base if you have lost millions of 
customers? 


Do drug companies fully understand this crisis? 
Most do not produce enough innovation 
to grow. In fact, half of them are shrink- 
ing. They try to mitigate this by escalating 
prices, which is dangerous. I think industry 
is misjudging the anger that its practices are 
creating. The risk of a backlash is real. Policy- 
makers could quickly make changes that may 
be much less palatable to industry than what 
could have been achieved by self-policing. m 


INTERVIEW BY ERIC BENDER 


This interview has been edited for length and clarity. 
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James Bradner (left) who worked at the Dana-Farber Cancer Institute in Boston, Massachusetts, with Jun Qi who synthesized the JQ1 compound. 


CHEMICAL PROBES 


A shared toolbox 


Ina pioneering move, the compound JQ] was released to the community for free. The 
impact that this has had on research and development is slowly coming into focus. 


BY ANDREW R. SCOTT 


lay dying from a rare and aggressive form 
of cancer. The disease, NUT midline 
carcinoma, had taken hold in his left lung and 
spread widely. A chest tube, inserted to offer 
some comfort, continually drained fluid con- 
taining cancer cells. When doctors explained 
the research potential of these cells, the man 
readily agreed to donate them to scientists 
at the Brigham and Women’s Hospital and 
at the Dana-Farber Cancer Institute, both in 
Boston, Massachusetts. This gift was one of 
the key steps in a pioneering development in 
open innovation that now serves as a model 
for drug researchers worldwide. 
James Bradner’s group at Dana-Farber had 
a molecule that they hoped might make these 


I n 2010, a young New England firefighter 
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cells “forget they were cancer’, as Bradner puts 
it. Sure enough, on exposure to this molecule 
the cells seemed to return to normal. And 
when the cancer cells were grown in mice — a 
model of the firefighter’s disease — and the 
animals were treated with the molecule, the 
tumours began to shrink (P. Filippakopoulos 
et al. Nature 468, 1067-1073; 2010). 

Sadly, this result could not save the fire- 
fighter, but Bradner’s decision to broadly 
release the compound through an open- 
source model of drug discovery has spawned 
a raft of patents and clinical trials of a new 
class of drugs: bromodomain inhibitors. 

The molecule that affected the firefighter’s 
cancer cells so dramatically was JQ1, named 
after the chemist who made the original com- 
pound, Jun Qi. JQ1 was not intended to bea 
drug; limitations in its solubility and half-life 
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mean that it probably never will be. Rather, 
JQ1 is a chemical probe — a small molecule 
that interacts with a target protein to allow 
the activity of that protein to be investigated 
in the cell. 

JQ1 binds to a pocket-shaped region 
found in many proteins, called a bromodo- 
main. This structure detects a small molecu- 
lar signal attached to the histone proteins 
that parcel up DNA into chromatin. When 
the signal, which Bradner describes as “a 
molecular post-it note”, is recognized by the 
bromodomain, it can stimulate the protein to 
begin a series of interactions that results in 
the activation of genes involved in sustaining 
cell growth and activity. But in some diseases, 
including cancer, cells interpret this signal as 
an instruction to grow and divide out of con- 
trol. The development of compounds to stop 


SAM OGDEN/DANA-FARBER CANCER INSTITUTE 


SOURCE: Z. ARSHAD ET AL. EXPERT OPIN. DRUG DISCOV. 11, 321-332 (2016). 


this errant bromodomain signalling could be 
a focus for therapy, and, as Bradner saw it, 
JQ1 had the potential to kick-start the process. 


OUT IN THE OPEN 

When JQ1’s effect on the firefighter’s cancer 
cells was first discovered, the conventional 
approach would have been to keep every- 
thing secret, explains Bradner, who took up 
the role of president of Novartis Institutes for 
BioMedical Research, based in Cambridge, 
Massachusetts, in March. Secrecy was the 
standard approach — all the details were kept 
hidden until either the prototype drug had 
been turned into an active pharmaceutical 
compound or efforts had come to a standstill. 
“But we did just the opposite,” says Bradner. In 
what he describes as a “social experiment’, he 
and his laboratory released all the information 
they had on JQ1. Not only that, but they also 
promised to supply unlimited quantities of JQ1 
to any researcher who wanted it, for free and 
without restriction on use. 

Bradner believes that this approach has 
greatly accelerated drug development in the 
field of bromodomain inhibition. His lab has 
sent samples of JQ1 to more than 400 laborato- 
ries worldwide, both in academia and industry, 
and there has been a clear increase in research 
activity around the bromodomain proteins it 
targets, leading to more than 100 filed patents 
(see ‘Driving innovation’). These patents are 
not for JQ] itself, but for other molecules that 
target bromodomains — the development of 
many of these was guided by the use of JQ1 
as a research tool. Some of the structures of 
these drug candidates are similar to JQ1, but 
others are completely unrelated. Bradner is 
convinced that, had he not adopted the open 
approach, there might be only one, or at most 
two, bromodomain inhibitors in clinical tri- 
als, including his own, TEN-010. Instead, he 
knows of eight agents under development for 
the treatment of cancer alone. 

“Normally the process of obtaining agree- 
ment to use these valuable reagents can be long 
and complex,’ says physician-scientist Ross 
Levine at the Memorial Sloan Kettering Can- 
cer Center in New York. Being able to obtain 
probes such as JQ] through open-access agree- 
ments has, he says, greatly assisted his research. 

Researchers who attempted to assess 
the impact of JQ1 came to a similar verdict 
(Z. Arshad et al. Expert Opin. Drug Discov. 
11, 321-332; 2016). They found that the open 
release of JQ1 increased innovation in the 
broader field of bromodomain inhibition — evi- 
dence, they said, that an open-access approach 
can improve the efficiency and lower the cost 
of both drug discovery and commercialization. 


THE WIDER VIEW 

Clinical trials are now under way on the use 
of bromodomain inhibitors to treat some of 
the most common cancers, including lympho- 
mas, leukaemias, multiple myelomas and solid 


DRIVING INNOVATION 


More patents have been filed related to 
bromodomains, the target for the JQ1 chemical 
probe, than for similar drug-discovery targets for 
which the probes were not made openly available. 
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tumours (such as cancers of the pancreas and 
prostate). Development of all these molecules 
has benefited from data gleaned from the avail- 
ability of the molecular probe. 

And cancer is not the only target. The open 
release of JQ] is also proving to be a boon for 
research into various neurodegenerative dis- 
orders such as Alzheimer’s disease, as well as 
inflammation and viral infection. 

JQ1 is just one molecule, however. How 
much of a trendsetter it will prove to be 
remains to be seen. Structural biologist Wen 
Hwa Lee at the University of Oxford, UK, says 
that although the progress made with bromo- 
domains isa sign of the value of open access, 
the wider impact of open innovation remains 
limited. “It is such a new concept,’ he says. 
“Having been told all their careers that they 
have to protect their data, many scientists can 
find it hard to get their heads around the idea 
of openly sharing” 

Still, developers of other chemical probes 
are aiming to replicate the JQ] success story. 
As part of the drive to increase the ease of 
access to many more probes, an alliance of 
researchers, with the endorsement of three 
life-sciences research organizations — the 
Structural Genomics Consortium (to which 
Lee belongs), the Institute of Cancer Research 
in London and the Broad Institute of Harvard 
and Massachusetts Institute of Technology in 
Cambridge, Massachusetts — have been lead- 
ing an effort funded by the UK’s Wellcome 
Trust to collate data on chemical probes and 
allow researchers to obtain supplies of these 
molecules. In late 2015, the group launched 
chemicalprobes.org, an online portal to freely 
share probe data. Anyone can submit a probe 
for publication on the site, although each sub- 
mission is subjected to peer review — designed 
to address past problems with poor-quality 
probes that generated misleading research 
results. As of March 2016, the portal hosted 
more than 100 probes targeted against 10 key 
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families of proteins, including bromodomains. 
Many of these probes are already being used to 
guide researchers towards new drugs for clini- 
cal trials. 


COMMERCIAL INTEREST 

Sharing chemical probes doesn't just appeal to 
academics — the practice is catching the atten- 
tion of drug companies too. Mark Bunnage, 
a medicinal chemist at biopharmaceutical 
company Pfizer, believes that JQ1 has had “a 
profound impact” on basic research that has 
allowed the search for new drugs. As well as 
making use of the probe itself, Pfizer has also 
begun to release more information about its 
own molecular tools. Many of the company’s 
chemical probes are available through vendors 
such as life-sciences company Sigma Aldrich. 
Pfizer also collaborates with the Structural 
Genomics Consortium to create new probes, 
a number of which have been included on the 
chemical probes portal. “This is a big change 
of mindset,’ says Bunnage. “Open-innovation 
approaches are definitely helping companies 
and academics to work together to advance 
the discovery and development of medicines.” 
He hopes that the increase in collaboration 
in the earliest stages may reduce the number 
of prospective drugs that fail during phase I 
clinical trials. 

The key to collaboration between academia 
and life-sciences companies is the sharing of 
chemical probes so that a protein or other 
biomolecule can be identified as a drug target 
early. This accelerates the basic science and 
allows market dynamics to kick in to incentiv- 
ize the development of specific drugs. “Every- 
one can eventually try to develop the best drug 
to interact with each target,’ says Bunnage. Lee 
agrees that for open-research platforms such as 
the chemical probes portal to be part ofa suc- 
cessful economic model, there comes a point 
at which barriers of confidentiality must come 
down. At some stage drug companies, driven 
by their requirement to make profits, must be 
free to develop their candidate drugs confiden- 
tially. Negotiating the transition from open to 
closed innovation may be a difficult process 
with high stakes. “Where the pre-competitive 
line lies must be explored by the partners 
involved and may also depend on the disease 
area and its needs,’ says Lee (see page S56). 

The path that Bradner’s career has taken 
since his decision to release JQ] into the 
wild demonstrates that openness and com- 
mercial interests are not mutually exclusive. 
Tensha Therapeutics, a start-up he founded 
in 2011 to develop drug-like bromodomain 
inhibitors, was acquired by pharmaceutical 
company Roche in early 2016 in a deal worth 
around US$500 million. Clearly, acting in the 
interests of open science need not preclude 
commercial success. m 


Andrew R. Scott is a science writer based in 
Perth, UK. 
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| CHALLENGES 


Crowdsourced. 
solutions 


Open competitions bring new minds, skills and 
collaborations to problems in biomedical research. 


BY ERIC BENDER 
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ours before the close of Kaggle’s compe- 
H tition to find out why almost one-third 

of women in the United States are not 
screened for cervical cancer, the leading team 
has submitted the 115th iteration of its model. 
Forty groups around the world are competing 
to win US$100,000 in a challenge sponsored by 
biotechnology company Genentech. 

The models are based on analyses of a 
150 gigabyte database of de-identified patient 
data, says computational biologist Wendy Kan, 
who set up the challenge and works at Kaggle 
in San Francisco, California, a company that 
runs predictive modelling and analytics com- 
petitions that allow data scientists to compete 
to solve complex problems. In addition to find- 
ing solutions, contestants are asked to explain 
their reasoning. “It’s very important for us to tell 
astory,’ Kan says. Later, ona Kaggle forum, a 
member of the winning team presents two of the 
groups hypotheses: multiple chronic diseases 
and mental-health issues are major factors in 
why some women skip screening. 

Another Kaggle challenge, which began in 
December, asked participants to transform the 
diagnosis of heart disease by coming up with 
an algorithm to examine cardiac magnetic 
resonance imaging (MRI) scans to see how 
well the heart is pumping blood — “A very 
difficult problem,” Kan says. Entrants used 
a cardiac MRI data set provided by the US 
National Heart, Lung and Blood Institute, and 
192 teams were in the running for the $200,000 
prize when the competition closed. The vic- 
tors were two quantitative analysts who have 
worked with hedge funds, but had no experi- 
ence in cardiology. 

So far, more than 450,000 data scientists 
have tried their hand at Kaggle’s predictive- 
modelling puzzles, says economist Anthony 
Goldbloom, founder and chief executive of the 
organization. The problems — many pertaining 
to health, but others in fields that range from 
criminology to search technology — are set 
up so that the background of entrants doesn’t 
matter, he says. As long as they have suitable 
modelling skills, no particular experience or 
qualifications are needed. 

“They are all smart, highly motivated and 
incredibly capable,’ adds Goldbloom. “The 
winning margin is usually very small; often the 
difference between first and second isn’t even 
statistically significant” 

Kaggle is one of a number of organizations 
running open global challenges in life sciences 
to address knotty problems in basic biology, 
clinical research or health care. The approach is 
steadily gaining backers in academic laborato- 
ries and classrooms, drug companies and gov- 
ernment agencies as a way to bring well-defined, 
but thorny problems to the attention of brilliant 
minds around the world. 

The design of the competitions varies from 
challenge to challenge and host to host. Some 
ask for modelling algorithms, others for ideas, 
and still more for prototype medical solutions. 


BRATISLAV MILENKOVIC. 


BIOMED X 


Prizes are often offered, although participants 
usually insist that money is not the main moti- 
vation. Some of the winning solutions, espe- 
cially those sponsored by industry, remain 
secret, but others are made openly available 
and a few have already resulted in advances in 
clinical research. 


CLOCKWORK ORIGINS 

Competitions in science and engineering have 
along history. In 1714, the Longitude Act saw 
the UK government offer a reward of £20,000 
(well over £2,000,000 ($2,865,400) in today’s 
money) for a solution to the problem of cal- 
culating longitude at sea. Not just one, but 
two answers emerged: the marine chronom- 
eter, developed by clockmaker John Harrison, 
which kept time at sea well enough for navi- 
gators to calculate longitude effectively; and a 
method for devising longitude from the motion 
of the Moon borne of a combined effort by sci- 
entists, including mathematician John Hadley 
and astronomer Tobias Mayer. 

But it took the advent of the Internet for 
crowdsourced medical contests to really take 
off, notably with the Critical Assessment of 
Protein Structure Prediction (CASP) experi- 
ments, which have seen research groups test 
their methods for predicting 3D protein struc- 
tures against those of their peers since 1994. 

The competitions gained more industry 
backing as pharmaceutical companies began to 
struggle with their pipelines. The crowdsourc- 
ing firm InnoCentive, for example, formed in 
2001, a time when “the pharmaceutical indus- 
try needed to rethink 


its business model”, “Prizes 
recalls Alph Bingham, canreally 
co-founder of the com- bring anew 
pany basedin Waltham, population of 
Massachusetts, and researchers 


then a vice president at 
pharmaceutical giant 
Eli Lilly. “The Internet let you access minds on 
a scale and a scope that had never been pos- 
sible before” 

Spun out from Eli Lilly, InnoCentive has held 
more than 2,000 open challenges and attracted 
more than 375,000 ‘solvers. The continual 
string of challenges can be tightly focused and 
relatively small, such as a $30,000 challenge to 
find a minimally invasive skin-biopsy method 
to measure gene expression, or attempt to tackle 
larger problems, such as a major $500,000 chal- 
lenge sponsored by the US National Institutes 
of Health (NIH) to look for robust methods 
to examine individual cells. Proposals such as 
these are inherently risky and might not survive 
the conventional NIH grant process. 

Indeed, challenges seem to hold a number 
of advantages over conventional research prac- 
tices. One of the leading crowdsourcing initia- 
tives is the Dialogue for Reverse Engineering 
Assessments and Methods (DREAM) Chal- 
lenges programme, which sees groups com- 
pete in open competitions to solve complex 


into the field.” 
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modelling problems in systems biology, says 
Gustavo Stolovitzky, co-founder of the project 
and computational biologist at IBM’s Thomas J. 
Watson Research Center in Yorktown Heights, 
New York. 

When dozens of teams around the world take 
on a DREAM project, they often accomplish 
in months what would take a single research 
group years, “since you can multiply the num- 
ber of people working on the problem by 50 or 
100,” says Stolovitzky. Many challenges also 
bring in researchers from other fields, who may 
approach problems in ways that those closely 
acquainted with them would not. 

Just as crucially, challenges jump-start col- 
laborative communities. For instance, the 
ICGC-TCGA DREAM Somatic Mutation Call- 
ing Meta-pipeline Challenge is a collaboration 
between DREAM, the International Cancer 
Genome Consortium, The Cancer Genome 
Atlas and biomedical research organization 
Sage Bionetworks in Seattle, Washington. Its 
aim is to improve standard methods for iden- 
tifying cancer-associated mutations and rear- 
rangements in whole-genome sequencing. In 
the process, they are building an ongoing com- 
munity in which researchers can find the best 
and latest algorithms, rather than having to go 
to scientific journals. 

Crowdsourced tournaments can also open 
up access to data — either those aggregated 
specifically for the purpose, such as Kag- 
gle’s cervical-cancer and cardiac MRI data- 
bases, or data sets that would otherwise lie 
dormant. “There are too many data silos in 
which researchers hoard their data, some- 
times for years,’ Stolovitzky says. “Ultimately, 
everybody should be able to look at that data 
with information about how the data was 
gathered, allowing collaboration and data 
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Scientists from around the world competed to win a BioMed X fellowship in Heidelberg, Germany. 
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sharing in a positive and meaningful way.’ 

In addition, contests can lower the legal bar- 
riers that plague collaborations between insti- 
tutions or companies, says Bingham. “They 
offer ways to engage all these different people 
without having to precede that whole process 
with 200 days of legal briefs being exchanged 
between institutions,” he says. 

For these contests to achieve these positive 
impacts, however, they have to be well man- 
aged. Crowdsourcing is of little help in areas 
in which research is at such an early stage that 
the organizers can’t ask the right questions. 
For any challenge to work, the problem must 
be well-defined and able to be judged fairly, says 
systems biologist Stephen Friend, co-founder 
and director of Sage Bionetworks. It’s also 
important for an impartial expert in the field 
to act as a convener and nurture the emerging 
community, he says. 

Non-profit foundations — increasingly 
important providers of research funding — 
are also making use of crowdsourcing. Often 
these focus on diseases that drug companies 
rarely target (see page S68). One example is 
Prize4Life in Berkeley, California, founded in 
2006 when Harvard business school graduate 
Avichai Kremer was diagnosed with amyo- 
trophic lateral sclerosis (ALS; also know as 
motor neuron disease), and best known for its 
$1-million contests. 

“Prizes can really bring a new population 
of researchers into the field,” says neurosci- 
entist Neta Zach, chief scientific officer at 
Prize4life. “And a lot of them continue to 
work on ALS.” Prize4Life’s first major chal- 
lenge addressed the lack of useful biomarkers 
for ALS progression. “We expected that the 
tool would be based on measurements from 
blood or cerebral spinal fluid,” Zach says. 
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Participants at a Massachusetts Institute of Technology Grand Hack discuss health-care challenges. 


Instead, the winning tool in 2011 was a more 
creative solution: a pain-free non-invasive 
medical device that measures the flow of 
electrical current through muscle tissue. The 
winnings helped to build the San Francisco 
start-up Skulpt, which is testing such devices 
in ALS trials (as well as offering them to con- 
sumers as fitness tools). 

The foundation also partnered with 
DREAM and InnoCentive in a $50,000 chal- 
lenge to predict the progression of ALS. When 
the predictions of the winning algorithm were 
compared with those made by ALS clinicians 
in the assessment of 14 people with ALS 
(R. Kuffner et al. Nature Biotechnol. 33, 51-57; 
2015), “the algorithm outperformed each and 
every one of the clinicians on each and every 
one of the patients’, Zach says. The model is 
now used to make ALS clinical trials more 
efficient and their results clearer — a better 
understanding of ALS makes it easier to assess 
the benefits of treatment. 

DREAM was launched in 2006 by 
Stolovitzky and systems biologist Andrea 
Califano at Columbia University in New York 
City to improve the state of the art in systems- 
biology modelling. As well as solving problems, 
DREAM challenges validate the solutions. 

Sometimes when data-science groups tackle 
a difficult problem, they can convince them- 
selves that they have produced a good solution, 
rather than actually solving it well. Stolovitzky 
calls this the “self-assessment trap’, which can 
lead to mistakes such as overfitting models to 
one set of data. But if 50 DREAM teams are 
involved, “we can see if we can really find a clear 
signal in the data’, he says. 

In 2012, DREAM joined forces with Sage 
Bionetworks, which had created Synapse, a 
pioneering open-computing platform for data 
analysis and sharing. The first joint challenge 
generated models to classify the aggressive- 
ness of breast cancer. The models clearly per- 
formed better than today’s commercial tests, 
says Friend. “More importantly, the challenge 
showed that people who had not generated the 
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data were able to get deep insights,” he says. 
“And the electrical engineer who won had very 
little chemical background.” 


RISING TO THE CHALLENGE 

Competitions are beginning to exploit the 
opportunities provided by data contributed 
directly by patients. Sage, for example, created 
mPower, an app that uses iPhone sensors to 
measure symptoms of Parkinson's disease pro- 
gression such as dexterity or gait. And Sage has 
partnered with other groups, such as Oregon 
Health and Science University in Portland and 
Harvard University in Cambridge, Massachu- 
setts, to create numerous such apps, which 
can very quickly pro- 
vide high-quality data. 


{9 
‘At the end “We have over 200,000 
of the day, . 
é people who have said, 
cash is often a 


I want to share my data 
with qualified users,” 
Friend says. 

In November 2015, a 
DREAM hackathon drew participants for two 
evenings of pizza, beer and the opportunity to 
begin interpreting data from tens of thousands 
of mPower users. That event reflects another 
trend in crowdsourcing — the rapid spread of 
biomedical hackathons. These are designed to 
bring experts from different disciplines face to 
face. The Hacking Medicine initiative at the 
Massachusetts Institute of Technology (MIT) 
in Cambridge, for instance, has so far hosted 
almost 50 such events, teaming up engineers 
and data scientists with clinicians in 1- or 2-day 
events that are meant to quickly and iteratively 
work towards initial solutions to a host of 
health-care problems. 

Among early results is an infant-resuscitation 
device for use in developing countries. The 
Ugandan paediatrician who first presented the 
problem has now taken the device into clini- 
cal trials in his country. The MIT initiative has 
helped to spark similar gatherings in places 
such as India and Uganda, led by the Consor- 
tium for Affordable Medical Technologies at 


scorecard, not 
a paycheck.” 
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Massachusetts General Hospital in Boston. 

Bringing researchers with varied expertise 
and skills together in one physical location can 
accelerate research. The BioMed X Innova- 
tion Center in Heidelberg, Germany, has gone 
further with what co-director, and biologist, 
Christian Tidona describes as an “outcubator”. 
Researchers compete not to come up with the 
best solution, but for the chance to try. 

BioMed X begins by posting a very specific 
problem from one of its sponsors online. This 
could be exploring a new drug target or an area 
of treatment new to the sponsor. These requests 
typically get 400-600 responses from around 
the world. BioMed X picks 15 of the most prom- 
ising concepts submitted and brings their crea- 
tors to Heidelberg, where they form teams for an 
intense 5-day competition. The winning group 
then tackles the problem in two- to four-year 
fellowships in Heidelberg. 

One of the first teams to go through the four- 
year exercise — made up of researchers from 
Germany, Slovenia and Egypt — created bio- 
informatics tools for designing highly selective 
inhibitors of kinases, proteins that play a part in 
many diseases. The sponsor, Merck, bought the 
intellectual-property rights and then licensed 
them back to the team, which formed a start-up 
company to develop the technology. 


RULES FOR THE FIGHT 

The benefits for research are clear, but what is 
it that drives participation in crowdsourced 
competitions? When a challenge is centred in 
a researcher's field, typically the greatest incen- 
tives to participate are the chance to publish 
a paper in a top journal and to network with 
peers, organizers say. 

But often the entrants are not the usual sus- 
pects. “They're also gadgeteers, basement inven- 
tors and weekend engineers,” says Bingham. “It’s 
nota bunch of French-literature majors that are 
solving our chemistry problems, but it might 
be physicists or intellectual-property attorneys 
or biologists” Even in competitions with cash 
prizes, “at the end of the day, cash is often a 
scorecard, not a paycheck’; he says. Challenges 
would be “a silly way to make money’, says 
Goldbloom. The main draw for participants is 
what originally led him to found Kaggle — the 
desire for “access to interesting data sets and 
interesting problems” 

For medical firms, the challenges often pro- 
vide a relatively quick and inexpensive way to 
solve tricky problems, Bingham says. At the 
same time, he points out, “in order to bring a 
product to market, they usually have to solve a 
thousand problems of equal complexity”. For all 
concerned, “the wisdom of crowds works beau- 
tifully in a great percentage of the cases’, says 
Stolovitzky. “We're seeing a lot more buy-in for 
these challenges. If you can multiply the num- 
ber of people, you can accelerate the research.” m 


Eric Bender is a science writer based in 
Newton, Massachusetts. 
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COMPOUND SCREENING 


Fresh hunting ground 


Inthe search for novel therapies, drug developers have begun crowdsourcing molecules. 


BY ANNABEL MCGILVRAY 


cross the world, more than 
A 15,000 chemical compounds are 

concocted every day, most of them 
in university laboratories. Until recently, the 
majority ended up in deep freeze — or on the 
rubbish heap. 

“When I was a PhD student I made 
300 molecules,” says Matt Cooper, a chemist 
at the University of Queensland in Brisbane, 
Australia. “Three years’ work, completely 
new molecules, all weird, all handcrafted. I 
remember having each of them labelled and 
bottled, but basically they went in the bin” 

More than 20 years later, Cooper is 
director of one of the world’s largest molec- 
ular-compound-screening programmes: the 
Community for Open Antimicrobial Drug 
Discovery (CO-ADD). Frustrated by the 
waste of novel molecules, any one of which 
might hold the key to life-saving treatment 


for antibiotic-resistant bacterial infection, 
malaria or tuberculosis, Cooper wants to 
put these under-appreciated creations to use. 

CO-ADD is collating and screening vast 
quantities of small molecules in an effort 
to help spur the development of powerful 
new antibiotics. Funded by the Univer- 
sity of Queensland and Britain’s Wellcome 
Trust, the programme has already received 
40,000 compounds for screening since 
its launch in early 2015. Of these, around 
26,000 have undergone high-throughput 
screening, an automated procedure that can 
test the biological activity of thousands of 
compounds a day and an established part of 
the early drug-discovery process. 

The compounds are tested for 
antimicrobial activity against the most 
dangerous hospital-acquired, antibi- 
otic-resistant infections Escherichia coli, 
Klebsiella pneumoniae, Acinetobacter 
baumannii, Pseudomonas aeruginosa and 


© 2016 Macmillan Publishers Limited. All rights reserved. 


OPEN INNOVATION 


methicillin-resistant Staphylococcus aureus 
(MRSA), as well as the common causes of 
fungal infections Cryptococcus neoformans 
and Candida albicans. The programme is 
working with 88 groups from 26 countries 
and has a further 300,000 compounds on 
the way, including those from what will be 
the first nationwide antimicrobial screen- 
ing project to be coordinated by France’s 
National Centre for Scientific Research. 

It is an initiative born of necessity at a 
time when profit-driven antibiotic research 
is floundering, says Cooper. “We think one 
of the reasons the industry is not productive 
any more is because we have been looking in 
the wrong chemical space.” He thinks that 
new drugs, including antibiotics, are likely to 
come from outside the conventional chemi- 
cal structures held in the vast collections of 
pharmaceutical companies. 

To see whether the molecules created in 
academic settings are more fruitful than the 
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TRE OPEN INNOVATION 


Intellectual property 


Remains with submitter 


Remains with submitter 


Remains with submitter 


Remains with submitter 


Transferred to Merck when 


Data 
Data posted online by 
CO-ADD after 18 months 


Data are shared with 
submitter immediately 


Data are shared with 
submitter after a year 


Data are shared with 
submitter 


Data become the property 
of Merck and are not 


SEARCH HISTORY 

Compound-screening programmes are relatively recent creations, and cover a range of therapeutic areas. They also vary in their processes, goals and legal 
requirements. The data made available to Nature suggest that, the more open the terms of the agreement, the more willing are external researchers to take part. 
Initiative Launch | Compounds screened Assay focus 

Community for Open | 2015 Around 26,000; 88 Antimicrobial 

Antimicrobial Drug organizations submitting 

Discovery (CO-ADD) 

Eli Lilly: Open 2009 70,000; 345 organizations Endocrine, cardiovascular, 

nnovation Drug submitting neurological, oncological and 

Discovery (OIDD) tubercular 

AstraZeneca: New 2014 Undisclosed Cardiometabolic, oncological, 

Molecule Profiling neurological and respiratory 

Leo Pharma: Open 2015 150; 15 organizations Dermatological 

nnovation submitting 

Merck: Open 2011 Undisclosed; 30 organizations | Oncological, immuno- 

Compound Sourcing submitting oncological and immunological 


commercial compound libraries, Cooper 
and his colleagues began to screen com- 
pounds put forward by an academic group 
for efficacy against the selected bacteria 
and fungal pathogens. Compared with the 
biological activity of commercial libraries, 
the academics’ compounds had a 20-30 
times higher hit rate. “Academics make 
compounds for a whole range of reasons,” 
says Cooper. “They are more eclectic and 
more diverse.” 

CO-ADD is now leading the way in tapping 
this diverse pool of potential drug leads by 
making it as easy as possible for chemists to 
share their creations and uncover potentially 
valuable medical uses. “We make no claims on 
intellectual property, so all the results and all 
of the intellectual property rests with the pro- 
vider of the compounds. There are no strings 
attached at all,” says Cooper. “We have become 
the first group to take open access to the nth 
degree and really reach out to people all around 
the world — crowdsourcing our molecules.” 

In an era of low productivity in early drug 
discovery, medicinal chemists in the corpo- 
rate realm have reached a similar conclusion 
to Cooper — the lack of chemical diversity 
is affecting early drug development. In late 
2015, pharmaceutical giants AstraZeneca 
and Sanofi took the unusual step of exchang- 
ing 210,000 proprietary compounds to boost 
the variety of compounds in their collections. 
And, like CO-ADD, some big pharma are invit- 
ing external researchers to put forward their 
compounds to determine whether they may 
have some medical application. In a world in 
which knowledge is fiercely protected, the new 
approach is prompting ongoing adjustments 
by everyone involved. Academic chemists are 
becoming more open about their research, 
and companies are now more willing to share 
intellectual property to encourage researchers 
to participate. This open-innovation approach 
is young, and drug development is notori- 
ously slow — a decade at best in most cases — 
so it may be years before a blockbuster drug 
results from this process. But the platforms are 
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showing promise, which is sparking excitement 
among the enthusiastic pharmaceutical execu- 
tives who are driving them forward. 


CHEMICAL DIVERSITY 

Screening external compounds is a ready- 
made source of innovation for big pharma and 
requires little investment, given that the com- 
panies already have the sophisticated screen- 
ing technology in place. “It gives us access to 
compounds that are really different to the types 
of compounds that we have in our collection,’ 
says Garry Pairaudeau, head of external sciences 
at AstraZeneca in Cambridge, UK. “A lot of the 
academic chemistry that is going on is quite 
unusual and quite specialized and would be 
difficult for us to reproduce in our lab” 

Merck KGaA, based in Darmstadt, Ger- 
many, Leo Pharma and Eli Lilly are also host- 
ing compound-screening platforms that are 
designed to engage external researchers (see 
‘Search history’). The corporate platforms 
ask academic scientists or those associated 
with biotechnology start-ups to first submit 
their compounds for computer analysis. The 
structure of the compound is not shared with 
the drug giants at this stage, ensuring that 
the original researchers’ intellectual property 
is protected. This analysis is carried out to 
determine whether the molecule meets the 
company’s threshold for further examina- 
tion. Compounds can be rejected because they 
are too reactive or unstable, or too similar to 
previously screened compounds. In the pro- 
grammes run by AstraZeneca and Leo, this 
initial analysis is performed by a third party as 
an added precaution to protect confidentiality. 

If compounds meet the various thresh- 
olds, the external researchers then provide a 
sample (typically 1-5 milligrams) for further 
investigation. It is at this point that the molec- 
ular structure is shared, and agreements that 
outline the obligations and expectations of 
the various parties, as well as details of where 
the intellectual property rights lie, are signed. 
With the exception of Merck, the intellectual 
property remains with the external researcher. 
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the compound is accepted 
for screening 


automatically shared 


The results of the screening are then either 
shared with or transferred to the external 
researcher — again, except for the Merck pro- 
gramme, which provides no biological informa- 
tion unless an agreement to collaborate further 
is signed. 

Each company then has a different procedure 
if it decides that it wants to develop the com- 
pound. Eli Lilly asks only for the right of first 
approach ifa compound shows promise. But the 
company has no legal recourse if a researcher 
takes a molecule elsewhere following the 
screening. “We hope that when somebody 
sends a compound to us, that we have the first 
right of approach,’ says Marta Pifieiro-Nufez, 
head of Lilly's Open Innovation Drug Discov- 
ery (OIDD) platform, based in Indianapolis, 
Indiana. “We hope that by working with them 
and engaging in the research with them that we 
become a preferred partner.’ It’s a similar story 
for Leo and AstraZeneca. 

Merck’s approach is more proprietorial. 
The standard agreement, signed when a com- 
pound is accepted for screening as part of its 
Open Compound Sourcing initiative, sets out 
that the arrangement must remain confidential 
and gives the company an exclusive three-year 
licence to develop the compound. Merck pays 
€200 (US$228) for the privilege, with a promise 
to pay a further €5,000 if the company files a 
patent application that includes the compound 
within 5 years. 

In all cases, the agreements at this point can 
become complex and, unlike the simpler no- 
intellectual-property approach of CO-ADD, 
require significant legal insight. “These things 
can take a long time,’ says Niclas Nilsson, head. 
of open innovation at Leo Pharma in Ballerup, 
Denmark, “which only further highlights the 
need for speedier and less-difficult processes to 
explore new collaborations”. 

These legal hurdles may be one reason 
why the response to some of the corporate 
screening platforms has been relatively weak 
compared with the CO-ADD programme. 
Merck has signed only around 30 contracts 
to screen small groups of compounds since 
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the initiative’s 2011 launch. In the seven years 
that the less prescriptive Lilly OIDD plat- 
form has been operating, the company has 
screened 70,000 molecules selected from 
400,000 submissions, and at the start of 2015 
had 345 groups participating. The screenings 
have been done at a consistent rate of about 
500 compounds per month, but according to 
Pifieiro-Nufez, the company has the capac- 
ity for thousands more. Meanwhile, Leo has 
screened just 150 compounds submitted by 
15 partners in the first 12 months of its Open 
Innovation programme — a long way from 
the roughly 26,000 that have been assayed by 
Cooper and his CO-ADD colleagues during 
that time. Cooper attributes the variation to 
concern about the motivation behind the pro- 
grammes. “People are probably more likely to 
engage with a not-for-profit,” he says. “They 
feel more comfortable that we're doing it for 
the right reasons.” 

Aware of the challenge of fostering 
participation, Nilsson set out to devise a 
screening programme that would be attractive 
to both small industry and academia. Nilsson 
says, “Our approach was based on interviews 
with biotech and university researchers to 
determine how low the bar would have to be 
set in order to persuade them to take part and 
share their creations”. He then negotiated with 
the company’s legal and business development 
arms to determine how barriers to submission 
could be reduced. 

Researcher feedback drove Nilsson’s team 
to further adapt the conditions of the pro- 
gramme. Initially, they arranged to share the 
data with participants, giving both parties 
rights of access and ownership. But it quickly 
became clear that this would be difficult for 
small biotechs to agree to — if they don’t own 
the data, they can't use them as leverage when 
negotiating with others. “So now we transfer 
the ownership of the data back to the external 
partner. It means that they can increase the 
value of their assets by using our services and 
then they can walk away if they want to. That 
is, of course, our risk,” says Nilsson. 

Although the numbers are small, Nilsson says 
that the fledgling programme has already gener- 
ated positive responses and the company is in 
the process of negotiating with one contributor 
ona future collaboration in relation to a treat- 
ment for the skin condition psoriasis. “They 
submitted compounds that are targeting new 
proteins that we didn’t even know existed. So, 
not only are they providing new chemistry, but 
they also open up anew science that we didn't 
know could develop,’ Nilsson says. “That is 
exactly what we're looking for with the open 
innovation — something new that we couldnt 
predict ourselves.” 


PROMISING PROGRESS 

It is too early to see the fruits of this open- 
innovation approach in the marketplace. But 
there are signs that this increased engagement 


Ana Cristina Parra Rivera, a student of Doug Frantz at the University of Texas, works remotely for Eli Lilly. 


with researchers and their molecules is having 
an effect. Doug Frantz, a chemist at the Uni- 
versity of Texas at San Antonio, has been con- 
tributing to the Eli Lilly compound-screening 
programme for five years. His laboratory 
has submitted around 500 compounds and 
recently became the first OIDD contribu- 
tor in the programme’s history to reach an 
agreement to progress a molecule to trials in 
animals. In the context of drug development, 
this is more than one-third of the way towards 

clinical use. The pro- 


“A lot of the gress of the potential 
academic chronic-pain drug (an 
chemistrv is antagonist for a form of 
quite hee. a sodium channel called 
and quite NaV 1.7) isa milestone 


for him as well as for 
Lilly. “It’s just a fantas- 
tic opportunity,’ Frantz 
says, “to be able to explore the biological activ- 
ity of my molecules, rather than just making 
them and sticking them in the freezer and 
publishing a paper. I can take that data and I 
can take the compounds and publish or put it 
into a grant application if 1 want” 

Frantz and his students can keep tabs on 
the assays that their compounds are under- 
going and any results through a secure 
online account. Many of their compounds 
have shown activity in these assays, and 
there are a few particularly promising com- 
pounds that they are considering adapting to 
meet Lilly’s threshold for further collabora- 
tion. As well as the NaV 1.7 antagonist, they 
have a second contract with Lilly that relates 
to a possible treatment for schizophrenia (an 
antagonist of the nervous-system receptor 
mGluR2). This has involved one of Frantz’s 
students working over the Internet to create 
compound analogues in Lilly’s automated- 
synthesis lab. 

The University of Texas at San Antonio is 
one of 255 universities that are contributing 
compounds for screening with Lilly. Pifieiro- 
Nujfiez says that, initially, “people would 
get up on their seats and say that we were 


specialized.” 
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crazy’, but now many in the academic com- 
munity as well as industry are embracing 
external compound screening. The results 
of the screening of compounds produced by 
external researchers by Lilly have been com- 
parable with, and sometimes better than, the 
1-5% hit rate of internally produced mol- 
ecules. “Even if the molecules don’t advance, 
they introduce starting points and they con- 
tribute to each project,’ Pifteiro-Nufiez says. 
The screening platforms are also a way to 
save money — AstraZeneca has put the cost 
of the internal production of the 210,000 
compounds that it received from Sanofi last 
year at around $25 million. 

After about a year of screening, the CO- 
ADD project has uncovered 128 hitherto 
unknown potential antibiotics that have 
passed tests for cell toxicity. On the basis 
of these results, the group is already talking 
to health-care-research funders, including 
the US-based Bill & Melinda Gates Founda- 
tion and the Wellcome Trust, about extend- 
ing the programme. By the end of 2020, 
Cooper hopes to have screened 1 million 
compounds. “When we do these things col- 
lectively and collegially, they’re of incred- 
ible value to society,’ he says. “This is like 
a molecule bank — we want to use this to 
screen for new compounds against malaria, 
against tuberculosis, against other pathogens 
and threats to world health.” 

CO-ADD seems to be well on its way to 
sourcing those 1 million molecules, but 
it remains very early days for this kind of 
open-innovation approach. Already, there 
is new engagement between big pharma and 
academia, and more diversity in compound 
collections and novel research leads. But only 
time will tell if drug developers can capitalize 
on this early promise. With only one com- 
pound thought to have made it to animal tri- 
als, these programmes still have a long way to 
go if society is to feel their true effect. m 


Annabel McGilvray is a freelance writer 
based in Sydney, Australia. 
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OPEN INNOVATION 


TROPICAL DISEASE 


A child in Tanzania is treated for malaria — a disease that could benefit from an open approach to medical research. 


A neglected cause 


Amore open approach to combating tropical diseases may help to overcome a 
pharmaceutical market failure. 


BY LUCAS LAURSEN 


e didn't know it at the time, but when 
Hern Matthew Todd posted a 

request for help on The Synaptic Leap, 
a website devoted to open-source biomedical 
research, he was sowing the seeds for a rivalry 
between an open initiative and a contract- 
research organization hired by the World Health 
Organization to reach the same goal. 

The aim of both projects, run in 2010, was to 
produce a safer, low-cost version of praziquan- 
tel, a treatment for the tropical parasitic infec- 
tion schistosomiasis. Up until that point, the 
treatment contained two enantiomers (mirror- 
image versions of the molecule that have slightly 
different properties) of praziquantel. One enan- 
tiomer has no effect on the parasite, but gives the 
drug a bitter taste. Eliminating this undesirable 
form could reduce side effects and help more 


patients to complete their treatment. The pure 
drug needed to be affordable. Todd, who is at 
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the University of Sydney in Australia, thought 
that an open project was the best way to achieve 
this. “Open is very well-suited for neglected 
diseases,” he says. “The pay-off of secrecy is not 
very large” 

This is because the potential buyers of 
drugs targeted at rare and neglected diseases, 
such as schistosomiasis and dengue fever, are 
often unable to pay the prices that large phar- 
maceutical companies must charge if their 
investments are to pay off. Occasionally, these 
companies have made investments that have 
no hope of direct commercial return, such as 
the decision in 1987 by Merck and Co., based 
in Kenilworth, New Jersey, to offer Mectizan 
(ivermectin), a drug for onchocerciasis or river 
blindness, free of charge. But too many of these 
decisions would risk a company’s survival. 
Although governments have often intervened 
to promote and fund research in areas with the 
highest financial risk, diseases that affect the 
poorest countries or smallest subpopulations 
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remain undertreated and under-researched. 

The race to produce a single-enantiomer ver- 
sion of praziquantel finished in late 2010 in an 
effective tie when both Todd’s open group and 
the contract-research organization posted their 
different but comparable solutions online. Half 
a decade later, Todd is aware that his method 
is still being used to produce the active enan- 
tiomer of praziquantel in various labs around 
the world. Todd points to the work as one of the 
success stories of open-source medical research. 
The project matched the results of the conven- 
tional approach, but, by collaborating with 
researchers previously unknown to Todd’s team, 
he says that the project did it faster than it ever 
could have done in secret. 

The research community is embracing the 
open approach to find treatments for diseases 
that have little potential to make a profit. By col- 
laborating, drug companies, non-governmental 
organizations and governments can share the 
load. Partnerships to handle development and 
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distribution of treatments for neglected and 
tropical diseases have already begun, as have 
efforts to find ways to share data on early-stage 
research to avoid duplication. And although 
open-source projects, such as Todd’s, remain 
the exception rather than the rule, they, too, are 


gaining popularity. 
FASTER SCIENCE 


Around the turn of the millennium, a num- 
ber of organizations were set up to connect 
industry, government and non-governmental 
organizations, and to promote the sharing 
of knowledge about tropical and neglected 
diseases — Medicines for Malaria Venture 
(MMV), the TB Alliance and the GAVI Alli- 
ance. Pharmaceutical companies, they argued, 
had compound libraries and dormant intel- 
lectual property that, for areas in which they 


could not expect to reap significant profits, (_- 


could be shared without threatening their 
financial prospects. 

In 2010, pharmaceutical companies 
GlaxoSmithKline and Novartis, and St. Jude 
Children’s Hospital in Memphis, Tennessee, 
publicly released a database, through MMV 
that contained 20,000 compounds that had 
shown some antimalarial activity. These ‘hits’ 
are an early step on the long path to design- 
ing new drugs (see page S65) and are usually 
proprietary. Researchers began to also ask for 
samples that they could work with, so, the fol- 
lowing year, MMV began offering a selection 
of the compounds by post. This ‘malaria box’ 
offered researchers a common starting point 
and the organizers hoped that it would acceler- 
ate research. 

The box meant that researchers did not waste 
valuable time creating duplicates of compounds 
already produced by someone else. “People 
could really focus on understanding novel 
chemical series,” says Jeremy Burrows, head 


of drug development at 
MMV based in Geneva, “Op enisv ery 
Switzerland. And thanks ° 
; aa well-suited 

to MMV’s coordination, for nesloctod 
researchers can divide A §' 

; diseases. The 
up the analysis to under- 


pay-off of 
secrecy is not 
very large.” 


stand the existing pool of 
compounds, rather than 
work in secret and risk 
overlapping efforts. 
Companies have also begun to share their 
intellectual property through large international 
systems, such as the Re:Search project, estab- 
lished in 2011 by the World Intellectual Prop- 
erty Organization (WIPO) to drive innovation 
around neglected diseases, malaria and tuber- 
culosis. Data and compounds, for example, are 
made available through research agreements 
that specify certain requirements, such as assur- 
ance that access to any end product is afford- 
able in the 49 least-developed countries. As of 
March 2016, the programme had 103 members 
operating under 100 research agreements. Five 
active antimalarial projects involve compounds 


in pre-clinical development — one step before 
human clinical trials — and a WIPO-initiated 
dengue-fever project is already putting a repur- 
posed compound through clinical trials. 


OPEN REVOLUTION 

When business specialist Henry Chesbrough 
of the University of California, Berkeley, popu- 
larized the term open innovation in 2003, his 
definition was a broad one. It included indus- 
trial adoption of academic practices, such as 
publishing results in journals, and collabora- 
tions, within which a select few partners would 
share information with each other, but maintain 
secrecy with the wider world. Todd’s approach 
with open-source science was more radical. 


The female (green) and male parasites that cause 
the tropical disease schistosomiasis. 


“In drug discovery and development, you 
havea spectrum,’ says Burrows. At one end are 
pharmaceutical firms going about their work 
in secret, he says. At the other is open source. 
This model, whereby information is made freely 
available to a wider community and not just the 
people directly involved in the collaboration, 
originated in software development. Central 
to Todd's project was the decision to make the 
team’s laboratory notes available to the world 
while they worked. Although this meant that 
any other teams working towards the same 
objective were able to see exactly what they were 
doing, it also opened them up to external help. 
“The people contributing really knew what they 
were doing,” says Todd. Not only does he say 
that their input accelerated the research, but also 
that timely suggestions to change direction kept 
them away from expensive blind alleys. “We 
could have wasted the whole grant on some- 
thing unproductive,’ he says. 

Sharing lab notes can also prevent duplication 
of effort, says Katy Graef, associate director of 
BIO Ventures for Global Health in Seattle, Wash- 
ington. In one case, Graef says, the non-profit 
organization learned that researchers at Saint 
Louis University in Missouri were planning to 
screen inhibitors of the enzyme METAP-1 for 
antitubercular activity, a test that GlaxoSmith- 
Kline had already performed. BIO Ventures put 
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the two parties in contact, and the Saint Louis 
researchers were able to redirect their research, 
instead of wasting an estimated three months. 

Since his first foray into open-source 
research, Todd has developed a bigger project: 
Open Source Malaria. The aim of the project is 
to find new medicines for malaria. So far, the 
team has explored several groups of promising 
compounds, including those released through 
MMV, and it is currently exploring the potential 
of a set of molecules that originated with the 
drug giant Pfizer. 

Anyone can contribute to the project, and 
with all data and ideas posted publicly, this is 
perhaps the most open approach yet to tack- 
ling a tropical disease. The project is part of an 
emerging trend in drug research. The Open 
Source Pharma conference, sponsored by the 
Open Society Foundations and the Rockefeller 
Foundation, was held in 2014 and 2015. And 
since 2008, the Indian government-funded 
Open Source Drug Discovery project has 
been conducting crowdsourced tuberculosis 
research with the aim of expanding to other 
neglected diseases. The project has identified 

potential molecules for fighting tuberculo- 
sis and built infrastructure for coordinating 
remote efforts on a tight budget. 
The global burden of neglected tropical 
diseases, as defined by the World Health 
Organization, as well as malaria and tuber- 
culosis is around 5.5% of the total number 
of healthy years lost to disability, poor health 
or early death. But research and development 
spending on neglected diseases in 2010 was 
only around 1% (about US$2.4 billion) of 
global-health research spending. And only 4% 
of therapies registered between 2000 and 2011 
were indicated for neglected diseases. The open 
research initiated in the past decade could begin 
to address these imbalances. 

Not all initiatives to tackle neglected dis- 
eases need to be run as publicly as Open Source 
Malaria to make an impact. The Wellcome Trust, 
for instance, began offering grants for multi- 
centre, transnational collaborations in 2015, 
inspired by the ad hoc partnerships that sprang 
up between labs during the Ebola crisis. “There 
are a lot of different models out there,” says 
Graef, “and it’s good that we have these differ- 
ent approaches to keep things complementary.’ 

Burrows shares her optimism, and points 
to four promising compounds that emerged 
from MMV’s open work and are included in 
candidate drugs now in clinical testing. It will 
be a few more years before any open-origin 
treatment for neglected diseases makes it from 
the sharing stage all the way through to com- 
mercialization, but that is where everyone, from 
non-governmental organizations to the largest 
pharma firms, is placing their bets. “Everyone is 
aware that opening things up makes them more 
efficient, Todd says. m 


Lucas Laursen is a freelance journalist based 
in Madrid. 
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smartest people are somewhere else. This principle, coined by 

co-founder of Sun Microsystems Bill Joy, could also apply to the 
best data or the most cutting-edge technologies. Providing open access 
to these distributed assets would accelerate science and innovation. 
But, despite the promise that open access holds, it has so far proved 
difficult to implement. 

Few would dispute that sharing science accelerates the rate of dis- 
covery. The fact that open science also boosts innovation in industry 
is less well appreciated. Economist Heidi Williams at the Massachu- 
setts Institute of Technology in Cambridge retrospectively analysed 
the commercial activity that flowed from two large sets of sequenced 
human genes (H. L. Williams J. Polit. Econ. 121, 1-27; 2010). One 
group was made up of sequenced genes that for a period of time had 
been available only under commercially restrictive terms. The genes 
in the other group had always been in the public 
domain. Over a 10-year period, Williams found, 
there was around 30% more commercialization 
activity from the open set of genes. 

A lookat the biopharmaceutical sector reveals 
a similar story. Both monoclonal antibody and 
phage-display technologies are used to identify 
precursors to antibody drugs. Both technologies 
were invented more than 25 years ago, and both 
have been used to discover successful medicines. 
But whereas the phage-display technology has 
been fiercely protected, monoclonal antibody 
technology was placed in the public domain. 
As of 2014, there were 47 approved monoclonal 
antibody drugs and only 7 derived from phage- 
display technology. 

Although it may seem counter-intuitive, open- 
ness is good for innovation. No wonder, then, 
that the idea is gaining traction in biomedical research circles. ‘Open’ 
has become one of the hottest topics in boardrooms, funding agencies 
and the media, and dozens of initiatives have been launched under 
the open brand. 

Open-access initiatives to make the scientific record more widely 
accessible are being championed by charitable and governmental 
organizations. Most of the largest funders now require that articles that 
are derived from research that they support are made freely available 
on the Internet after a period of time, often a year or less. These organi- 
zations are also working to ensure the publication of large data sets 
through initiatives that are modelled on the Human Genome Project 
(HGP), which released data daily and without restriction on their use. 

These are positive steps, but there remains much room for improve- 
ment. Open-access publishing, for instance, is often possible only 
when publishers charge hefty publication fees (which are paid for with 
public funds) and some of the highest-impact journals continue to 
resist open-access policies. Few of today’s open-data initiatives meet 
the 20-year-old standards set by the HGP — most allow data release 
to be delayed and let primary investigators control the data, and many 
place various restrictions on data use. 


r | Vo paraphrase Joy’s Law, no matter where you work, most of the 
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PROMISE THAT 


OPEN ACCESS 
HOLDS, IT HAS 
SO FAR PROVED 
DIFFICULT TO 


IMPLEMENT. 


Science is still too closed 


Open initiatives are promising, but we have much further to go if research 
data are to be as publicly accessible as they should be, says Aled Edwards. 


Indeed, few initiatives are truly open. The term open inno- 
vation, as defined in the management literature in 2003 (H.W. 
Chesbrough Open Innovation: The New Imperative for Creating 
and Profiting from Technology; Harvard Business Press, 2003), 
refers to a collaboration in which two or more companies share 
or license proprietary information between themselves — a far 
cry from true openness. More recently, companies have begun 
to embrace less restrictive collaborative approaches to access 
more ideas and technologies, including crowdsourcing projects 
(see page S62) and pre-competitive consortia (see page S56). 
However, industry-led initiatives that yield publicly accessible research 
are still rare. 

The open-access movement has gained so much momentum that 
it can be tempting to believe that everything is awesome. The real- 
ity is more nuanced. Although the progress towards open access is 
encouraging, there is a long way to go before all 
scientific results are communicated in real time, 
at no cost and without restriction on use as a mat- 
ter of course. 

Considerable change is needed. Research 
produced in universities should be avail- 
able to all, but it is not. Universities often 
limit access to their research output because 
they continue to adhere to the ideology that 
secrecy and patents are obligatory founda- 
tions for commercialization and innovation. 
The imbroglio over who owns the rights to the 
CRISPR-Cas9 gene-editing technology will 
probably emerge as another case study for how 
the financial interests of institutions can inhibit 
innovation by limiting, rather than promoting, 
the uptake and application of foundational 
technologies. 

Experimental reagents and protocols should be freely available to 
allow researchers to reproduce experiments; this is not always the case, 
and even when it is, most are encumbered by legal agreements that 
restrict their use. 

Data from clinical and genetic studies should be made available to 
the study participants, but they are not. In most such studies, the data 
are considered to be proprietary, and there is no obligation to release 
them to the participants of the study, much less the public. 

If this is to change, I propose that society first agree on a simple, 
guiding principle: all scientific discoveries first constitute a public 
good and only second are the property of individual scientists, insti- 
tutions or countries. Agree on this, and it follows that anything that 
impedes the sharing of discoveries — either by prolonging the time 
or complicating the process of disseminating scientific outputs — 
should be eliminated entirely. We should not be satisfied with any- 
thing less. m 


Aled Edwards is chief executive of the Structural Genomics 
Consortium at the University of Toronto in Canada. 
e-mail: aled.edwards@utoronto.ca 
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A researcher at the Montreal Neurological Institute and Hospital, which is making all its data public. 


DATA SHARING 


Access all areas 


Advocates say that open science will be good for innovation. 
One neuroscience institute plans to put that to the test. 


BY BRIAN OWENS 


development, where aggressive defence of 

data and intellectual property is thought to 
be key to amassing profits, one academic insti- 
tute is opting out. 

Over the next five years, McGill University’s 
Montreal Neurological Institute and Hospital 
(the Neuro) in Canada will conduct a radical 
experiment in open science. It will make all 
results, data and publications from its research 
free to access, will require collaborators to do the 
same, and, perhaps most surprisingly, will not 
pursue patents on any of its discoveries. 

The primary motivation for the move is to 
increase the pace of discovery in neurosci- 
ence —a field in which clinical progress has so 


lE the cut-throat world of early-stage clinical 


far been slow. “We think that by sharing data 
quickly, we'll be able to accelerate the discovery 
of mechanisms and eventually new medicines,’ 
says Neuro director Guy Rouleau. 

But Rouleau acknowledges that there is alsoa 
moral argument for opening up scientific data. 
“We're funded mostly by public money, so it 
makes sense that it be freely available” 

The Neuros open policy, expected to come 
into effect this summer, is based on five prin- 
ciples developed through a series of consulta- 
tions with the institute's faculty and staff. The 
first is that Neuro researchers will make all 
information about a 


study publicly avail- NATURE.COM 
able by the time the Toread more about open 
research is published. science, visit: 


This requirement  go.nature.com/rpucxz 
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will apply to all of the results — positive and 
negative — as well as models, algorithms, rea- 
gents and software. 

Second, all data and resources generated 
through new research partnerships — whether 
they be with companies, institutes or other uni- 
versities — must follow the same rules. Third, 
the institute’s biobank, which contains tissue 
samples and brain-scan data, will be opened up 
(although the institute may charge users a small 
fee to cover operating costs). 

The fourth principle is that the institute will 
not pursue any intellectual-property protections 
for research discoveries. And the fifth is a com- 
mitment that, although the institute will not 
support activities that undermine these open- 
science principles, it will respect its research- 
ers’ autonomy. In practice, this means that a 
researcher could pursue a patent on their work, 
but the Neuro would not pay any of the fees or 
help with the paperwork. 

By sharing data and results early and often, 
scientists should get a better idea of what is 
going on elsewhere in their field, and avoid 
exploring blind alleys that others have already 
rejected. This is particularly important in 
neuroscience, says Rouleau, where progress is 
slowed by both the vast complexity of the brain 
and the heterogeneity of neurological diseases. 

Having a solid understanding of brain 
mechanisms is important for pharmaceutical 
companies working to develop new treatments, 
says Viviane Poupon, the institute's director of 
partnerships and strategic initiatives. “Without 
mechanisms, they're just fishing randomly,’ she 
says. “We're trying to help diminish the 95% fail- 
ure rate of drug candidates targeting the central 
nervous system.” 

How much the principles of open science 
really can speed up innovation is a question 
that the Neuro hopes to help answer. Richard 
Gold, a researcher at McGill who studies the use 
of intellectual property in the life sciences and 
helped the Neuro to design its open principles, 
will be monitoring the institute's performance. 
He will then compare it with that of similar 
institutions that are not pursuing openness to 
see whether the Neuros decision leads to better 
scientific outcomes. 

The specific metrics he will use are yet to be 
decided, but he suggests that he will be look- 
ing at how closely the institute follows its own 
principles, whether it inspires others to fol- 
low suit, whether people make use of its open 
resources, and how successful it is at attracting 
funding and staff. 

Open-science advocates hope that greater 
access to information will also help to solve a 
larger problem facing science than the slow 
translation of research into products: the fact 
that many high-profile results have proved 
impossible to replicate. “We're facing a credibil- 
ity problem. Not a month goes by without some 
field of science being rocked by scandal,’ says 
Bjorn Brembs, a neurobiologist and open-sci- 
ence advocate at the University of Regensberg 
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Guy Rouleau, director of the Neuro, favours open 
science. 


in Germany. “Of the five or six large-scale rep- 
lication studies that I know of, none of them 
confirm more than 50% of results.” 

It’s no wonder, then, that the pipeline of 
new treatments has slowed to a trickle. “If the 
pre-clinical work is not replicable, then you 
cant make a drug out of it” says Brembs. Mike 
Ehlers, the chief scientific officer of Pfizer’s 
Neuroscience and Pain Research Unit in 
Cambridge, Massachusetts, agrees. “The num- 
ber of key findings that we are able to robustly 
reproduce is not what I would want to see.” 

Christof Koch, president of the Allen 
Institute for Brain Science in Seattle, Wash- 
ington, thinks that having all the data from 
every experiment freely available will go a 
long way towards alleviating this problem. “If 
science wants to overcome this crisis, this lack 
of reproducibility, we have to practice what we 
preach, and practice open science,’ he says. 


GROWING FAMILY 

The Neuro is joining a growing movement 
towards the free sharing of scientific data and 
results. Some large-scale projects, such as the 
international Human Genome Project, have 
freely shared all of their data, and many chari- 
table funders, such as the Bill & Melinda Gates 
Foundation, require the researchers they sup- 
port to make their data and published results 
freely available. 

The Allen Institute, launched in 2003 by 
Microsoft co-founder Paul Allen, has followed 
an open-science model “from the get-go’, says 
Koch. The institute offers free access to its huge 
gene-expression maps for mice, humans and 
other animals, and posts data from its research 
online as soon as it is ready, rather than keep 
it hidden from researchers at other institutions 
until they are ready to publish a complete paper. 
“All our biggest papers have been published two 
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to three years after the data was put online,” 
says Koch. “The idea that you need to hold on 
to your data until after you publish is not true.” 

Drug companies are also embracing the idea 
that there are advantages to sharing in the early 
stages of research. “In the past 10 years or so 
we've seen greater movement towards multi- 
party collaborations,’ says Gold. “Generally in 
areas of science that are expensive, and consid- 
ered pre-competitive and high risk” 

The big pharmaceutical companies can see 
the value of working more openly with aca- 
demic neuroscientists and with each other to 
mitigate those expenses and risks. “It’s a very 
complex field,” says Hans Lindner, head of 
global external innovation and alliances at 
the pharmaceutical company Bayer in Berlin. 
“Combining efforts is essential to deal with 
complex matters.” 

Much of the work is done in large groups 
involving multiple companies, universities and 
public agencies, all combining their cash, exper- 
tise and equipment, and sharing the results. 
“They realized that they're all spending money 
doing the same thing, and they could leverage 
that money through partnerships to do more 
risky work,” says Gold. This provides universi- 
ties with a new source of funding and allows 
government agencies to steer work towards 
their priorities. It also benefits the companies, 
which see more efficient discovery of potential 
drug targets. 

Rouleau says that his inspiration for the 
Neuro’ open-science project came from one 
such large-scale collaboration, the Structural 
Genomics Consortium (SGC). The SGC is 
a collaboration between six universities and 
eight pharmaceutical companies from around 
the world. It receives funding from government 
agencies in Brazil and Canada, and the UK Well- 
come Trust, and generates protein structures, 
chemical probes and antibodies to speed up the 
development of new drugs. “When you can get 
academics to share, and get industry to share, 
you get a hell of a lot more and better data out 
of that collaboration,” says Aled Edwards, the 
SGC'’s director. “It's a fantastic way to do science.” 

Lindner says that open innovation is “an 
essential part of our R&D strategy”. In addi- 
tion to taking part in the SGC and the Euro- 
pean Union's Innovative Medicines Initiative, 
a public-private project to boost pharmaceuti- 
cal innovation, Bayer has several crowdsourc- 
ing initiatives that provide access to Bayer 
compounds, and an ‘incubator’ laboratory that 
provides start-ups with lab space and access to 
its facilities. “The benefit of the open source is 
it may ease up the early testing of concepts, and 
increase the chance of an interesting concept 
being further developed,” Lindner says. “Ulti- 
mately, we have more shots on goal.” 


NO PATENTS PLEASE 

Although open science is a growing trend, 
at least one aspect of the Neuro’s plan 
seems to be unique — the vow to eschew all 
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intellectual-property protections on its work. 
It’s a move that is proving popular in the 
open-science community, where most feel 
that the drive to protect every discovery has 
gone too far and is stifling progress. “Patents 
don’t help drive innovation,” says Edwards. 
“They just get in the way most of the time.” 

Ehlers is even more blunt: “Universities 
tend to slather IP on every finding, regard- 
less of its potential value.” 

Lindner, meanwhile, is intrigued by the ‘no 
patents’ approach, and sees some potential 
benefits. “It may ease up the initial interac- 
tion between the institute and other parties,” 
he says, when legal niceties such as intel- 
lectual property rights and licensing fees 
are usually decided. 
“Otherwise you have 


“Institutions fa nee ote tii 
ia a at the beginning, 
ime patenting, which can be lengthy 
and most patents and frustrating.” 
don’t generate However, Ehlers 


»” 
any money. does foresee com- 


plications for com- 
panies that want to develop products that are 
based on the Neuro’s work. Companies often 
prefer to work with protected ideas, because it 
gives them a way to recoup their investment. 
“There has to be a well-calibrated use of pat- 
ents,’ Ehlers says. “If there’s too little protec- 
tion, there’s no way to capture the value” 

Rouleau says that the institute is not 
completely opposed to the idea of patents. 
If a Neuro discovery shows commercial 
promise, a pharmaceutical company will 
be welcome to take it in house and use it 
to develop a patentable medicine. Rouleau 
acknowledges that this means that the insti- 
tute and McGill would then lose out if one 
of the Neuro’s ideas became a blockbuster 
drug, but he thinks that risk is low. “We’re 
working at such early stages, anything we 
discover will need to be taken and worked 
on for years — our share of any profits would 
likely be small,” he says. 

And any loss should be more than made up 
for by new investments from philanthropic 
organizations and companies, from whom 
the Neuro’s open concept is already attracting 
serious interest. 

Gold agrees that passing up patents will 
be no great loss for the Neuro. “Institutions 
waste a lot of time patenting, and most pat- 
ents don’t generate any money,’ he says. “This 
gets the university out of the business of busi- 
ness, and back into knowledge generation” 

Besides, the institute’s top priority should 
be to do its best for the people relying on it to 
help cure their illnesses, not to make money, 
says Rouleau. “Blocking other people from 
working with our findings is not in the best 
interest of patients,” he says. = 


Brian Owens is a freelance science writer 
based in St. Stephen, New Brunswick. 
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COMMENT 


for biomarkers 


Industry—academia collaborations 


Lilla Landeck? 


Biomarkers are becoming increasingly important for 
predicting disease prognosis, enabling personalized 
therapy (also known as precision medicine) and detecting 
early therapeutic and adverse responses to drugs. How- 
ever, the identification, validation and application of bio- 
markers is challenging, with several aspects, including 
understanding of the biology of the biomarker and its 
relevance to disease, the technological characteristics 
of the assay used for biomarker measurement and the 
potential regulatory requirements. The challenges are 
particularly demanding when co-developing a drug and 
a companion diagnostic based on a biomarker. In many 
cases, the breadth and complexity of research needed 
mean that collaborations are vital. This article highlights 
examples of such initiatives and discusses key factors for 
success. 


Public-private partnerships 

Substantial public funding has become available for bio- 
marker research in recent years. Public—private partner- 
ships (PPPs) such as the European Innovative Medicines 
Initiative (IMI), the US Biomarkers Consortium and 
others (see Supplementary information S1 (table)) offer 
the opportunity to create a critical mass of partners in a 
specific area. Usually, several companies and academic 
institutions join forces to address an issue that would be 
hard or even impossible to tackle alone. 

For example, in 2006, the US Food and Drug 
Administration (FDA), the US National Institutes of 
Health (NIH) and the Pharmaceutical Research and 
Manufacturers of America (PhRMA) founded the 
Biomarkers Consortium!. To date, the consortium has 
launched 18 projects in four disease areas — cancer, 
inflammation and immunology, metabolic disorders, 
and neuroscience — and nearly half of these projects 
have been completed. The consortium has made notable 
contributions to accelerate the drug development pro- 
cess and to enhance regulatory decision-making. For 
example, the establishment of a set of evidence-based 
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Several types of collaboration are being pursued to identify, validate and apply new 
biomarkers. Here, we highlight examples of such initiatives and discuss the challenges, 
approaches to address these challenges and key factors for success. 


recommendations on new interim end points supported 
the FDAs efforts to approve new agents for community- 
acquired bacterial pneumonia and acute skin infections’. 
Another important project, the I-SPY 2 trial of potential 
drugs to treat breast cancer, has illustrated how innova- 
tive adaptive trial designs may substantially reduce the 
time and cost of getting new therapies on the market’. 
The Predictive Safety Testing Consortium, which origi- 
nated from the FDA’s Critical Path Initiative and has 
been collaborating for several years with the Biomarkers 
Consortium on a kidney safety project, has qualified 
seven biomarkers of nephrotoxicity for use in preclini- 
cal toxicology studies and continues to work towards 
additional safety-biomarker qualifications’. Work by 
another consortium originating from the Critical Path 
Initiative, the Coalition Against Major Diseases, has led 
to the qualification of low baseline hippocampal volume 
as assessed by magnetic resonance imaging as an enrich- 
ment biomarker for clinical trials in pre-dementia stages 
of Alzheimer disease’. 

Biomarker-related programmes are also being pur- 
sued through the IMI. For example, the SAFE-T project 
has evaluated more than 150 biomarkers for drug- 
induced liver, kidney and cardiovascular toxicity on 
the basis of protocols defined upfront together with the 
European Medicines Agency (EMA) and the FDA. The 
20 most promising biomarkers are just entering clinical 
validation. Another interesting example is the recently 
constituted IMI consortium CANCER-ID, which aims to 
evaluate the clinical utility of different technologies for the 
enrichment, isolation and analysis of circulating tumour 
cells (CTCs), circulating tumour DNA and microRNAs. 
These biomarkers could support the concept of a ‘liquid 
biopsy, which may allow for longitudinal sampling when 
standard biopsies are not available or pose a considerable 
risk to the patient. The 36 CANCER-ID partners from 
academia, hospitals, small-to-medium-sized enterprises, 
non-profit organizations, and diagnostics and pharma- 
ceutical companies are now working on defining standards 
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for CTC identification and counting as well as defining 
the standard operating procedures for pre-analytical 
sample handling and analyses. 

PPPs also facilitate the testing of novel approaches, 
such as systems medicine for the identification of bio- 
marker patterns that could improve patient stratifica- 
tion. One example is the IMI project OncoTrack, which 
is using novel computational modelling approaches to 
analyse deep sequencing data from >200 patients with 
colon cancer. 

Another European consortium, known as BIO-DrIM 
(Biomarker-Driven IMmunosuppression), will apply 
recently identified biomarkers to help manage immuno- 
suppression after solid organ transplantation. In several 
clinical trials that are supported by diagnostics and phar- 
maceutical companies, biomarkers will be used for the 
first time as decision criteria to guide personalized immu- 
nosuppression, which could allow the minimization of or 
even weaning from drugs without harming the kidney 
and liver allografts. The methodological implementation 
of these biomarkers is finalized, and up to 1,000 patients 
will be enrolled in trials. 

Crucial factors for the success of such complex con- 
sortia include: finding a good balance between critical 
mass and manageable size; involvement of both diagnos- 
tics and pharmaceutical industry partners; early align- 
ment on the goals between the public and the private 
partners; agreement on a suitable intellectual property 
framework before the project starts; and professional 
project management. 


Open innovation 

The use of open-innovation models, also known as 
‘crowd sourcing; for biomarker discovery and validation 
is still in its infancy*. A pioneering example is provided 
by InnoCentive, which was established in 2001 to match 
problems in a wide variety of fields with potential solu- 
tion providers, using an online platform on which chal- 
lenges are posted by organizations. For example, a chal- 
lenge with a US$1-million prize was set up in 2006 to 
find a biomarker for amyotrophic lateral sclerosis (ALS) 
by Prize4Life, a non-profit organization established to 
accelerate the discovery of ALS therapies. This challenge 
attracted ~1,000 teams from ~20 countries, and the prize 
was ultimately awarded in 2011 for the development of 
electrical impedance myography to assess the progres- 
sion of the disease. The results of another crowd-sourcing 
challenge set up by Prize4Life and the DREAM Project, 
which involved the use of clinical trial data from ALS 
patients to develop algorithms for the prediction of disease 
progression, have recently been published’. 

Another example is Bayer’s Grants4Targets (G4T) 
initiative’, which was established in 2009 as a mecha- 
nism for academic institutions to apply for grant sup- 
port to pursue ideas on novel drug targets, and which 
was expanded to include biomarkers in 2011. In addition 
to financial support, Bayer provides specific knowledge 
about drug discovery and development. More than 1,000 
applications have been received so far; 126 (~16%) have 
been for biomarkers, of which 18 have been approved for 
grants. These are focused on biomarker identification in 


cardiological, oncological and gynaecological indications, 
often using novel technologies. As the G4T programme is 
quite young, it is too early to judge its ultimate success in 
the identification and validation of biomarkers. However, 
on the basis of our experience so far, this type of collabora- 
tion attracts academic groups and offers the chance to gain 
access to new partners. Key prerequisites for success are: 
fast and efficient processing of the requests; a low bureau- 
cratic burden to generate and grant the proposals; and 
intensive and direct personal contact with the scientists in 
the academic institutes after grant approval. 


Concluding remarks 

Collaboration between academia, the diagnostics indus- 
try and the pharmaceutical industry is particularly 
important for successful biomarker identification, vali- 
dation and application, and requires substantial resources 
and complementary skills. We expect the high number of 
PPPs in this area will continue to increase, in part owing 
to the growing availability of public funding. 

Different models may be best suited for different 
stages of the biomarker identification and validation pro- 
cess and for different kinds of biomarkers. For the initial 
exploration of a new biomarker candidate or a novel tech- 
nology, an open innovation approach may be particularly 
suitable, as a lot of creativity, flexibility, and freedom is 
needed at this early stage. For the subsequent clinical 
validation of a biomarker to the standards required for 
acceptance by regulatory authorities, larger consortia 
may be necessary given the extent and complexity of the 
work involved. 

Each category of collaboration has its own needs to 
be successful. However, there are some general factors. 
Organizations and people will only actively engage in col- 
laboration when the benefit they derive is greater than 
the effort and time it takes to collaborate. Organizations 
also need to foster their competencies in relationship- and 
conflict-management and skills in working across the 
diverse cultures in academia and industry. Finally, real 
internal commitment from the organizations involved is 
essential for any collaborative effort to be successful. 


] Wholley, D. The Biomarkers Consortium. Nat. Rev. Drug. Discov. 
13, 791-792 (2014). 

2 Stephenson, D. & Sauer, J. M. The Predictive Safety Testing 
Consortium and the Coalition Against Major Diseases. 
Nat. Rev. Drug. Discov. 13, 793-794 (2014). 

a Lessl, M. et al. Crowd sourcing in drug discovery. 
Nat. Rev. Drug. Discov. 10, 241-242 (2011). 

4 Kiiffner, R. et al. Crowdsourced analysis of clinical trial data to 
predict amyotrophic lateral sclerosis progression. Nat. Biotech. 
33, 51-57 (2015). 

5 Dorsch, H. et al. Grants4Targets — an open innovation initiative to 
foster drug discovery collaborations between academia and the 
pharmaceutical industry. Nat. Rev. Drug. Discov. 14,74—76 (2015). 


Acknowledgements 

The authors would like to thank T. Schlange, M. Lessl, H. Dorsch (all at 
Bayer, Germany), and H.-D. Volk (at the Berlin-Brandenburg Center for 
Regenerative Therapies, Charité, Berlin, Germany) for helpful discussions, 
and P. Carrigan for critical reading of the manuscript. 


Competing interests statement 
The authors declare competing interests: see Web version for details. 


SUPPLEMENTARY INFORMATION 


See online article: S1 (table) 
ALL LINKS ARE ACTIVE IN THE ONLINE PDF 


806 | DECEMBER 2015 | VOLUME 14 


www.nature.com/reviews/drugdisc 


© 2015 Macmillan Publishers Limited. All rights reserved 


PERSPECTIVES 


OPINION 


Hit and lead criteria in drug 
discovery for infectious diseases 
of the developing world 


Kei Katsuno, Jeremy N. Burrows, Ken Duncan, Rob Hooft van Huijsduijnen, 
Takushi Kaneko, Kiyoshi Kita, Charles E. Mowbray, Dennis Schmatz, 
Peter Warner and B. T. Slingsby 


Abstract | Reducing the burden of infectious diseases that affect people in the 
developing world requires sustained collaborative drug discovery efforts. 
The quality of the chemical starting points for such projects is a key factor in 
improving the likelihood of clinical success, and so it is important to set clear 
go/no-go criteria for the progression of hit and lead compounds. With this in 
mind, the Japanese Global Health Innovative Technology (GHIT) Fund convened 
with experts from the Medicines for Malaria Venture, the Drugs for Neglected 
Diseases initiative and the TB Alliance, together with representatives from the 
Bill G Melinda Gates Foundation, to set disease-specific criteria for hits and leads 
for malaria, tuberculosis, visceral leishmaniasis and Chagas disease. Here, we 


present the agreed criteria and discuss the underlying rationale. 


There is an urgent need for new and 
more-effective drugs to treat the various 
diseases that take the heaviest toll on the 
developing world’. This can only be achieved 
in a cost-effective manner by implementing 
robust and efficient processes to develop 
and deliver drugs that are safe, effective, 
affordable and available to those who need 
them most. 

The quality of chemical starting points 
(known as ‘hits’) for drug discovery projects 
is a key factor for improving the likelihood 
of success of clinical candidates; starting a 
discovery project with poor-quality hits ulti- 
mately results in increased attrition of these 
compounds”*. The decisions to progress a hit 
into ‘lead’ identification (the HTL phase) and 
then on into lead optimization are crucial, 
as the downstream optimization phase may 
take years and require considerable financial 
investment. Setting the bar high by applying 
comprehensive, well-considered criteria for 
entry into lead optimization will improve 
overall success rates and focus resources 
on chemical series that stand a reasonable 


chance of delivering a quality preclinical 
candidate’. The need to focus resources 
effectively is particularly important in 
drug research and development (R&D) 
for infectious diseases that affect people in 
the developing world, given the limited 
market incentives for R&D investment and 
the key role that philanthropic funding and 
public-private partnerships have in this field. 
The Japanese Global Health Innovative 
Technology (GHIT) Fund (BOX 1) is a pioneer- 
ing public-private partnership. It provides 
resources for research organizations in Japan 
to partner with international research groups 
and product development partnerships 
(PDPs) to screen and identify new hits and 
develop them into novel lead series for infec- 
tious diseases that affect people in the devel- 
oping world. The initial areas of focus for the 
GHIT Fund are malaria (in partnership with 
the Medicines for Malaria Venture (MMV)), 
tuberculosis (TB; in partnership with the 
TB Alliance), Chagas disease and visceral 
leishmaniasis (in partnership with the Drugs 
for Neglected Diseases initiative (DNDi)). 


Specific criteria have been proposed for 
defining hits and leads in the development 
of drugs for diseases such as malaria*®; 
however, it is essential that such criteria are 
regularly reviewed and updated as part of 
evolving target product profiles (TPPs) to 
reflect accumulated experience from drug 
discovery projects and emerging scientific 
research, clinical experience, policy guidance 
and patient need. In order to reach a consen- 
sus on the HTL criteria to be used to guide 
its collaborative activities, the GHIT Fund 
recently convened an initial gathering of its 
key international partners involved in drug 
discovery — the MMV, the TB Alliance and 
the DNDi — together with representatives 
from the Bill & Melinda Gates Foundation. 
The objective of this meeting was to take the 
first steps towards creating a shared, flexible 
strategy to expedite the discovery of the next 
generation of drugs for these diseases by 
GHIT Fund collaborations. Here, after briefly 
discussing some of the general characteristics 
of the relevant screening assays and estab- 
lished target product profiles, we present the 
proposed generic and disease-specific criteria 
for hits and leads. We hope that these guide- 
lines may also be valuable to drug discovery 
efforts outside these partnerships. 


Hit discovery and assay development 
For infectious diseases, hit candidates 
usually come from screens that involve intact 
pathogens, akin to phenotypic screens (or, 
more generally, high-content screens) rather 
than target-based screens. Which of these 
approaches is more productive is still being 
debated”*. Despite being fuelled by advances 
in genomics, target-based high-throughput 
screening, along with computer-assisted 
modelling, has not been as productive as 
phenotypic screening in the antibacterial area’. 
However, for subsequent optimization efforts 
there is no doubt that knowing the molecular 
target of a hit series is a major advantage. For 
instance, such knowledge allowed the clinical 
antimalarial candidate DSM265 to be specifi- 
cally optimized to inhibit Plasmodium falci- 
parum dihydroorotate dehydrogenase rather 
than its human orthologue’’. Additionally, 
knowing the molecular target enables target- 
specific liabilities to be identified earlier 

in the R&D process. Furthermore, from a 
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Box 1| The Global Health Innovative Technology Fund: unlocking Japanese innovation 


The Global Health Innovative Technology (GHIT) Fund of Japan was founded in April 2013 with a 
vision to alleviate the burden of infectious disease that prevents billions of people from seeking the 
level of prosperity and longevity commonly expected in industrialized nations. Uniquely, the GHIT 
Fund aims to do this by facilitating international partnerships that enable Japanese technology, 
innovations and insights to have a more direct role in making this vision a reality®”**. Along with six 
Japanese pharmaceutical companies and one diagnostic company, the GHIT Fund is supported by 
two Japanese government ministries and numerous unexpected co-sponsors, including Mori 
Building Co., Ltd., All Nippon Airways Co., Ltd., Morrison & Foerster LLP, and Yahoo! Japan (see the 
figure, part a). The Japanese pharmaceutical industry is represented by the 69 members of the 
Japanese Pharmaceutical Manufacturers Association. 

A key aim of the GHIT Fund is to link existing product development partnerships (PDPs) — 
responsible for discovering and delivering new medicines for malaria, tuberculosis (TB) and 
neglected tropical diseases (NTDs) — with Japanese partners that have expertise in drug research 
and development (R&D). This is achieved by establishing a portfolio of screening projects and a 
hit-to-lead (HTL) platform that use defined criteria to ensure that the most attractive chemical series 
are selected for further optimization and development. Research collaborations must comprise at 
least two distinct organizations, one of which is Japanese, in order to be eligible. For the screening 
and HTL platforms the compounds must originate from a Japanese entity. In addition, the HTL 
platform aims to maximize the skills and resources of existing global PDPs that operate in the field of 
NTDs and other infectious diseases that affect people in the developing world, such that each new 


partner must enter into a research collaboration with one of three specific PDPs (the Medicines for 
Malaria Venture (MMV), the Drugs for Neglected Diseases initiative (DNDi) or the TB Alliance). 

Beyond the benefits of sharing expertise, collaborations of this type will allow diverse chemical 
libraries to be probed using various screening approaches. Importantly, Japanese compound assets 
of synthetic and natural origin could be a rich source of novel and chemically diverse compounds 
that have not previously been screened for infectious diseases of the developing world. Past 
examples of key natural-product-based drugs that originated from Japanese discovery efforts 
include ivermectin, an antiparasitic drug from the actinomycete Streptomyces avermectinius®”); 
the multiple sclerosis therapy fingolimod, which is a metabolite of the insect fungus Isaria sinclairii’’; 
and the founder of the statin drug class, mevastatin, which was discovered in Penicillium spp. 
extrolites by Akira Endo of Sankyo in the 1970s’*”*. 

In just over 2 years since its inception, the GHIT Fund has facilitated more than 30 partnerships 
(for screening collaborations see the figure, part b) and invested more than US$40 million in them. 
All proposals received are evaluated by external reviewers and a selection committee, both of which 
are completely independent from the GHIT Fund. 

In addition to the existing three platforms (the screening, HTL and product development platforms), 
the GHIT Fund also initiated the ‘Target Research Platform in Partnership with Grand Challenges’ 
(TRP) for early-stage development of radically new and improved drugs, vaccines and diagnostics 
to prevent and treat infectious diseases that are prevalent in developing countries. With additional 
Japanese partnerships expected and additional compound screening and HTL proposals being 
submitted®”’, the GHIT Fund encourages research and shows a commitment to the cause that is 
unprecedented in this area of research within Asia. 

a 
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portfolio point of view, it ensures that a mix 
of mechanistic approaches are followed, 
which is especially relevant in diseases in 
which drug resistance develops and spreads 
quickly (for example, TB and malaria). 

Fortunately, advances in genomics and 
related technologies usually enable the molec- 
ular target of a drug (or at least its mode of 
action) to be elucidated even when the com- 
pound was discovered in a phenotypic screen; 
such screens may therefore be viewed as 
discovery engines for druggable targets, along 
with the drug molecules themselves. The two 
crucial success components for screening 
are the availability of chemical libraries with 
‘interesting’ chemistry and taking great care 
to develop assays that faithfully reproduce the 
microenvironment of a pathogen with a dis- 
ease-relevant readout, while also maintaining 
throughput and robustness. Such advanced 
assays also eliminate compounds that do not 
penetrate cellular membranes or are otherwise 
unavailable to the target. 

The organisms that cause malaria, 
TB and NTDs are biologically quite diverse, 
including viruses, bacteria and eukaryotes. 
Each of these comes with its own challenges 
and opportunities, resulting in differences 
in hit and lead criteria and assay conditions. 
Dogma holds that bacteria replicate quickly 
(~20-minute generation time) and eukaryotic 
(mammalian) cells double in number in 
~24 hours, but these guidelines do not apply 
to the pathogens that cause the four diseases 
discussed in this article. Mycobacterium 
tuberculosis is an exception among bacteria 
in that it has up to 16-hour doubling times, 
whereas the doubling times of the eukaryotic 
Leishmania spp. parasites, which cause 
visceral leishmaniasis, are ~6—9 hours. 
The in vitro culture of all major strains of 
P. falciparum — the species responsible for 
the majority of deaths from malaria — has 
been an important breakthrough", but its 
doubling time is ~48 hours’. Owing to these 
differences, biomass generation and assay 
set-up vary in difficulty between organisms. 

The potency cut-offs for compounds to 
progress to the next stage of development 
varies by disease (see below) and is decided 
by a number of factors. Among these 
factors are empirical hit rates (the higher the 
hit rate, the lower the cutoff concentration). 
Plasmodium spp. hit rates are generally good, 
which is possibly related to the unusual (and 
vital) apicoplast organelles that these organ- 
isms harbour. Moreover, these organisms 
must actively overcome the toxicity of haem 
degradation and remodel their host cell (red 
blood cell) membrane transport capabilities, 
and these processes are druggable. By contrast, 
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it is perhaps not surprising that targeting 
and killing the intracellular amastigote form 
of Leishmania spp. parasites is more chal- 
lenging, as amastigotes reside inside acidic 
phagolysosomes within macrophages, which 
present additional membranes and pH gra- 
dients that drugs must cross before reaching 
their target. Trypanosoma cruzi, the parasite 
that causes Chagas disease, presents its own 
challenges owing to its ability to infect many 
cell types, producing a highly dynamic infec- 
tion. Furthermore, drugs that only target 

the replicating stages of the parasite may 
leave non-replicating forms, such as trypo- 
mastigotes, capable of maintaining infections 
long after the end of the treatment”. 


Established target product profiles 

It is important that hit and lead series are 
assessed as early as possible for conformity 
with the relevant disease TPPs and, if appro- 
priate, target candidate profiles (TCPs). 
However, in most cases, further detailed 
biological or pharmacokinetic studies will 
be required to fully judge how a series can be 
strategically positioned. These TPPs and 
TCPs are typically developed by the PDPs 
in discussion with the research and medical 
communities. 


TPPs and TCPs for malaria. In the case of 
malaria, two TPPs and five TCPs have been 
established, which reflect different patient 
populations and medicinal uses*™. In brief, 
malaria TPP1 is focused on treatment, ideally 
from a single dose, such that all symptomatic 
and asymptomatic parasites (gametocytes) in 
a host are cleared and transmission is blocked 
in addition to the patient being cured. TPP2 
is focused on prophylaxis, because when 
attempting to eliminate a disease it is recog- 
nized that people in once-endemic regions, 
now with potentially reduced immunity, may 
require protection in the event of transmis- 
sion outbursts. The TPPs focus on the profile 
of a medicine that is composed of two or 
more active ingredients (a criterion that is 
presently mandatory for artemisinins”»). 

By contrast, the TCPs focus on attributes 
that individual molecules need to possess 
(acknowledging that one molecule may 
possess more than one attribute). The key 
features of the five TCPs are: fast parasite 
clearance (TCP1); a combination partner, 
ideally long duration, which provides post- 
treatment prophylaxis (TCP2); prevention 
of Plasmodium vivax and Plasmodium ovale 
relapse (TCP3a); prevention of transmission 
(TCP3b); and chemoprotection (TCP4)°. 
TCP1, TCP2, TCP3a and TCP3b all support 
TPP1, whereas TCP4 supports TPP2 (REF. 6). 


PERSPECTIVES 


Box 2 | Generic hit selection criteria for infectious diseases 


The panel of drug experts convened by the Japanese Global Health Innovative Technology (GHIT) 

Fund identified the following generic hit selection criteria: 

e Ahit should have a potency that is consistent with the potential to deliver a lead compound 
(see BOXES 4-6 for details of criteria defined specifically for each disease) 

¢ The chemical structure of a hit should be confirmed by identification (for natural products), 
re-synthesis or re-purification 

e Primary screening data should be validated on a selection of hit compounds (>90% pure) 

e Ahit should have an acceptable in vitro response (typically, a sigmoidal concentration-growth 
inhibition curve reaching a maximal 100% efficacy, with a Hill coefficient ideally between 0.5 
and 1.8) 

e Preliminary knowledge of the structure—activity relationship (SAR; often available from 
analogues in the original screening library) of a hit is desirable 

e Ahit should have a tractable chemotype: it should have no highly reactive or unstable moieties 
in the pharmacophore and be amenable to structural variation by chemical (or biochemical) 
synthesis. Hits should pass basic drug-like filters, such as pan-assay interference filters (PAINS), 
to eliminate promiscuous hits that lack target specificity’*’’. Conformity to the ‘rule of five’ 
(REF. 58) is preferred 

e There should be a greater than 10-fold selectivity window for cytotoxicity using a mammalian 
cell line (for example, HepG2 or Vero cells) 

e Ahit requires adequate selectivity in a biochemical counter-assay (for example, a homologous 
mammalian target) where relevant. However, most infectious disease hit-to-lead projects are 
not target-based screens but phenotypic 

e Noserious intellectual property conflicts should exist (that is, a ‘freedom to operate’ is needed). 
However, with the value of US Food and Drug Administration (FDA) priority review vouchers now 
entering the pharmacoeconomic equation, there are further possibilities to develop drugs for 


infectious diseases even in absence of intellectual property protection 


¢ No major synthesis or formulation issues should be anticipated (compounds should ideally 
be synthesized in <5 steps with an acceptable yield and acceptable solubility). For reasons of 
affordability, this criterion is more stringent than for drug discovery in general 


TPPs for TB. TPPs for TB usually require 
new drugs to shorten the duration of treat- 
ment, demonstrate efficacy against drug- 
sensitive and drug-resistant M. tuberculosis 
strains and show potential for use in drug 
combinations in developing countries 

(for example, those with oral and once-daily 
dosing, and low cost of goods). These goals 
are based on the fact that the majority of TB 
drugs and treatments have not changed in 
the past half-century, even though two new 
drugs (bedaquiline’® and delamanid'’) were 
recently approved — the first new drugs for 
TB in the past 40 years. Additional drugs are 
needed to develop new treatment regimens. 


TPPs for Chagas disease and leishmaniasis. 
The DNDi has defined a TPP for Chagas 
disease with ‘acceptable’ and ‘ideal criteria (see 
the DNDi website). The benchmark for clini- 
cal efficacy is benznidazole, and this should 
therefore be the comparator for all hit and lead 
candidates. Similarly, the DNDi has published 
(with consultancy) ‘optimal’ and ‘minimal’ 
TPPs for visceral and cutaneous leishmaniasis. 
For visceral leishmaniasis, a safe oral drug 
with >90% efficacy within 10 days is crucial. 
Relevant for early compound triage is that a 
drug must be active against all resistant strains. 


Generic criteria for hits and leads 

Hit series definition and criteria. A number 
of generic hit criteria, identified by a panel 
of drug experts convened by the GHIT 
Fund, apply to all four infectious diseases, 
and these are listed in BOX 2. In general, 
the objectives at this stage are to build 
confidence in the quality of a compound 
series and the associated data, understand 
the liabilities and, if possible, generate data 
that guide medicinal chemists during the 
HTL phase'*. Two main types of criteria 
are covered: disease-specific criteria, focus- 
ing on potency, efficacy and pathogenicity; 
and compound-specific criteria, which 
focus principally on the chemical scope 

of the compound and a risk assessment of 
drug metabolism and pharmacokinetics 
(DMPK), as well as the physical properties 
that are predictive of an oral therapy. 


Lead series definition and criteria. During 
the HTL phase, the project team focuses on 
the optimization of a chemical series so as 
to improve any compound properties that 
could be an obstacle to further progression. 
Depending on the profile, the HTL strategy 
and medicinal chemistry plan can be very 
different between series. The milestone for 
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Box 3 | Generic lead selection criteria for infectious diseases 


The panel of drug experts convened by the Japanese Global Health Innovative Technology (GHIT) 

Fund identified the following generic lead selection criteria: 

e Alead requires an acceptable in vitro potency for the relevant disease (see BOXES 4—6 for 
disease-specific details). In general, high potency (a low half-maximal inhibitory concentration 
(IC,,)) is highly desirable but not at the expense of poor physicochemical properties or drug 
metabolism and pharmacokinetic (DMPK) characteristics 


A lead should have oral efficacy in the appropriate disease model. Oral efficacy removes one key 
risk early on, namely, uncertainty about in vivo validation of the mechanism of action of the series 
(which is often unknown at this stage) 


The synthetic chemistry should be amenable to series expansion, as many more compounds are 
likely to be needed for testing in as short a time frame as possible 


A lead needs a greater than 10-fold selectivity in killing pathogens as opposed to mammalian cells 
in tuberculosis (TB) and a greater than 100-fold selectivity for the pathogens that cause the other 
diseases (malaria, Chagas disease and visceral leishmaniasis). This reflects the increased challenge 
of finding potent, attractive compounds that target TB 


A lead should have acceptable physicochemical properties (typically, solubility in 
phosphate-buffered saline >10 uM; a sufficient level of solubility is expected to avoid problematic 
formulations”. Acceptable lipophilicity; LogP values are typically <5 and ideally <3)°* 


A lead requires manageable drug metabolism and pharmacokinetic profiles. This involves liver 
microsomal and hepatocyte stability across species; understanding of the plasma protein, 
microsomal and media binding; good membrane permeability; and no unmanageable cytochrome 
P450 inhibition 

A lead needs to demonstrate good oral bioavailability in rodents (demonstrated F>25%); 
ultimately, high oral bioavailability is highly desirable as it will reduce the potential for inter-patient 
variability, pill size, dose and cost of the medicine. It is therefore crucial that this parameter is 
tractable at the lead stage 


A lead needs an acceptable early safety assessment based on target (orthologue) and compound 
liabilities, in vivo observations, in vitro studies (for example, genotoxicity and the mini-Ames test), 
cytotoxicity, cardiac safety (as assessed using the hERG channel (QT prolongation)) and in silico 
approaches. A secondary pharmacology selectivity profile, consistent with achieving selectivity 
with the candidate compound, is also required. This would include human orthologues and 
paralogues of the targeted enzyme or receptor, if known; the number of these that are to be tested 
depends on the gene family size and their known or suspected safety risks (when targeted) 


All liabilities of the series should be understood (as a result of extensive profiling) and a rationale 
and medicinal chemistry plan generated for how they might be overcome in the subsequent 
optimization phase 


A lead should contain no known toxicophores or undesirable reactive groups and no detrimental 
chemical feature or characteristic associated with the pharmacophore indicative of, for example, 
adduct formation. This avoids the scenario in which a region of the molecule responsible for 
activity has a liability that cannot be overcome 


A lead should display no acute toxicity in in vivo efficacy studies. Although no formal in vivo safety 
studies are performed at this stage, careful observation of efficacy studies, particularly at high, 
repeated doses, can be informative 


e Preferably, there should be no apparent intellectual property obstacles to the progression 
of any series (freedom to operate) 


this phase is the delivery of a lead series. 
The generic criteria for such a lead are 
shown in BOX 3 (REF. 19). 


Disease-specific hit and lead criteria 
Guided by the specific requirements for each 
disease as well as existing target product and 
candidate profiles, the committee coordinated 
by the GHIT Fund devised disease-specific 
criteria for hits and leads, which are discussed 
below and summarized in BOXES 4-6. 


Malaria. There are several good treatments 
for malaria, but the challenge of emerging 
drug resistance is ever present, particularly 


in South-East Asia where cases of increased 
parasite-clearance times are on the rise in 
patients treated with combination therapies 
that include artemisinin derivatives (for 
example, artesunate). This has revealed a 
new type of Plasmodium spp. resistance — 
essentially one in which the ring stage of the 
intra-erythrocytic cycle can tolerate drug 
intervention”. As the mainstay malaria 
treatments rely on a component that is 
artemisinin-based, a global public health 
catastrophe could emerge unless new 
drugs that overcome this risk are delivered. 
In addition, current antimalarials generally 
target the asexual blood stage of the disease. 


Breaking the life cycle and killing parasites 
during other asymptomatic phases — such as 
the sexual-stage gametocytes and liver-stage 
hypnozoites and schizonts — will be crucial 
for providing treatments to block relapse 
in P vivax infections, preventing transmis- 
sion and protecting vulnerable populations. 
Indeed, only with such drugs can the global 
goal of eradicating malaria be realized. 
Extensive research on these two disease-causing 
pathogens (P. falciparum and P. vivax) over 
the past decade has helped to establish some 
very specific entry criteria for drug discovery 
programmes". 

The cellular potency criterion for a hit 
for malaria research (BOX 4) is based on the 
extensive screening efforts of the MMV and 
its partners over the past 7 years*. Through 
partnerships in both industry and academia, 
more than 5 million compounds have been 
screened against the asexual blood stages of 
P. falciparum and the cutoff potency from 
these in vitro screens has mostly been around 
1-2 uM. Nevertheless, more than 25,000 com- 
pound hits were available for follow-up””’. 
Screening against liver stages of Plasmodium 
spp. has also delivered many potent hits”. 
Naturally, activity should be confirmed with 
a pure compound, and hits should have selec- 
tivity for the parasite over a mammalian cell 
line (for example, a greater than 10-fold 
difference between the IC,, and CC, the 
half-maximal inhibitory concentration against 
the parasites and the half-maximal cytotoxic 
concentration against the host mammalian 
cells, respectively) and display an acceptable 
concentration response, all of which provide 
confidence in a specific interaction and effect. 

By the start of lead optimization there 
is a need for clarity on the TPPs and TCPs of 
the series, as these define the goal and tactics 
for the subsequent optimization phase. 
The potency required for progression 
depends on the TCP. For blood stages (TCP1 
and TCP2), the crucial aspect is in vivo oral 
efficacy in the P. falciparum SCID (severe 
combined immunodeficiency) mouse 
model of infection”, with the key feature 
for TCP1 compounds being rapid parasite 
clearance in vivo, at rates at least as good 
as those of chloroquine. For prophylaxis 
(TCP4), the crucial aspect is in vivo oral 
efficacy in a Plasmodium berghei sporozoite 
challenge model (or equivalent)*®. From 
experience, achieving a dose that eradicates 
90% of the target pathogen (ED,,) <50mg 
per kg provides confidence that the series 
has a parasitological foundation for the lead 
optimization phase; ultimately, a success- 
ful candidate will have an ED,,<10mg per 
kg (often considerably lower). Typically, 
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this is achievable if a compound has an IC, 
<100 nM against the blood or liver stages 
and appropriate DMPK properties. However, 
a compound may still be acceptable for 
follow up if one of these components does 
not meet the proposed criteria, provided 
the other parameter is outstanding. 

Given the risk of resistance and a need for 
drugs with novel modes of action, potency 
in vitro across a panel of established drug- 
resistant parasite strains isolated from patients 
is also crucial. The genetics underlying the 
emergent resistance against artemisinins, as 
seen in the Mekong area in South-East Asia, 
are becoming increasingly understood**®, 
and this knowledge is being used to set up 
panels of parasites against which new drug 
candidates can be tested. For TCP] at least, 
new drugs are most likely to be used in 
combinations (as presently mandated for all 
artemisinins). This means that a newcomer 
must be evaluated for potential drug—drug 
interactions with other TCP! antimalarials. 

For TCP3a (anti-relapse), it is necessary 
to have in vitro data supporting the activity 
of compounds on hypnozoites™, whereas for 
TCP3b additional activity against mature and, 
ideally, early-stage gametocytes is required in 
addition to asexual blood-stage potency”. 


Tuberculosis. M. tuberculosis is so well 
adapted to its human host that almost one 
third of the world population is estimated to 
be infected. Although only 10% of infected 
individuals are believed to develop TB 

in their lifetime, this still results in ~1.5 
million deaths annually** (see the World 
Health Organization (WHO) website). 
Therefore, there is an urgent need for new 
anti-TB drugs. 

It still takes 6 months to cure drug-sensitive 
TB (DS-TB) and a minimum of 18 months 
to treat multidrug-resistant TB (MDR-TB). 
It is highly desirable to shorten treatment 
duration for both patients with DS-TB and 
those with MDR-TB to improve compliance 
and to limit the spread of drug-resistant TB 
(DR-TB). Standard care of patients with 
DS-TB includes a combination of isoniazid, 
rifampicin, pyrazinamide and ethambutol 
for the first 2 months and a combination of 
isoniazid and rifampicin for the remaining 
4 months. Patients with MDR-TB, whose 
M. tuberculosis strains are resistant to isonia- 
zid and rifampicin, are treated with second- 
line TB drugs that include aminoglycosides, 
quinolone antibiotics, cycloserine and 
capreomycin. An updated dataset was 
recently published for the most commonly 
used TB drugs with respect to in vitro 
potency, cidality, physicochemical and 
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Box 4| Summary of main checkpoint criteria for antimalarial hits and leads 


Guided by the specific requirements for the disease, and taking into account existing target product 
and candidate profiles, the committee coordinated by the Global Health Innovative Technology 
(GHIT) Fund devised the following disease-specific criteria for hits and leads for malaria. 


Validated hit 
° Cellular potency criteria: hits should have an effector concentration for half-maximum 
response (EC,,) <1 uM for sensitive and multiple resistant strains of Plasmodium spp. 


¢ Cytotoxicity criteria: hits require a greater than 10-fold selectivity between the half-maximal 

cytotoxic concentration (CC,,) for the mammalian cell line and the EC, , for Plasmodium spp. 
(See also BOX 2 for the evaluation criteria regarding the acceptability of chemical structure, 
novelty and confirmation) 


Early lead 

¢ Cellular potency criteria: a lead requires EC,, <100nM for sensitive and multidrug-resistant 
strains of Plasmodium spp. 

e Cytotoxicity criteria: a lead should have a greater than 100-fold selectivity between 
mammalian cell line CC,, and Plasmodium EC,,. Frontrunners should be tested across the 
malaria life cycle and key mechanistic assays so as to ensure an understanding of the 
phenotype and target candidate profile (TCP) potential of each series and (preferably) novel 
mechanisms of action 


e Invivo efficacy criteria: when administered orally in the blood stages of infection (TCP1 and 
TCP2), a lead should achieve parasite clearance at a dose that eradicates 90% of the target 
pathogen (ED,,) <50mg per kg (typically four doses over 4 days) in the Plasmodium falciparum- 
infected SCID (severe combined immunodeficiency) mouse model. TCP1 should demonstrate a 
rapid rate of parasite clearance. For TCP3a, the anti-relapse TCP, there are no in vivo criteria for 
leads (in the absence of good models), but a lead should demonstrate anti-hypnozoite activity 
in vitro. For TCP3b, the transmission-blocking TCP, a lead should demonstrate potency ina 
gametocyte assay (for example, as measured by gamete formation”) in a similar range to that 
of the in vitro asexual blood stage potency. For TCP4, the chemoprotection TCP, a lead should 


have efficacy in a prophylaxis model of malaria, with ED, <50mg per kg 


pharmacokinetic properties generated 
under standardized conditions’. For patients 
treated for MDR-TB, the cure rate is only 
~48% and this needs to be improved 
drastically, whereas the cure rate of 
patients treated for DS-TB is ~85%’. 

There has been a steady increase in TB 
drug resistance; the WHO estimated that 
3.5% of new cases and 21% of previously 
treated cases are MDR-TB*®. Among patients 
with MDR-TB, an estimated 9% have exten- 
sively drug-resistant TB (XDR-TB), meaning 
that the pathogens are resistant to all second- 
line TB drugs* in addition to rifampicin and 
isoniazid. A new drug must show efficacy 
against all resistant strains, which calls for 
agents with novel mechanisms. 

The new candidates must also be orally 
available to ensure wide usage, especially in 
developing countries. For the same reason, 
the cost of goods needs to be low and pill 
size (which is determined by potency and 
pharmacokinetic properties) reasonable. 
Furthermore, as they will be used in com- 
bination with other TB drugs to stem the 
emergence of resistance, the new drugs must 
have a low risk for drug-drug interactions. 
As patients with TB are often co-infected 
with HIV, new agents also need to be com- 
patible with most of the anti-retroviral 


drugs. All of these requirements are 
especially stringent because TB drugs are 
administered for extended time spans. 

Treatment shortening can only be achieved 
with an agent or a regimen that effectively 
eliminates non-replicating M. tuberculosis°. 
Non-replicating M. tuberculosis is metaboli- 
cally less active and less susceptible to anti- 
biotics than actively replicating bacteria’, 
and new hits must be evaluated for their 
capacity to kill both replicating and non- 
replicating M. tuberculosis, as measured in 
the microplate Alamar Blue assay (MABA”) 
and low oxygen recovery assay (LORA**), 
which are both suited for high-throughput 
screening. The desired hit and lead profiles 
are summarized in BOX 5. 

The most-recent TB hits were discovered 
in phenotypic screens with cultured M. tuber- 
culosis, typically using the MABA’. In addi- 
tion, low-oxygen culture conditions (LORA”*) 
and other assays*”” are used to identify hits 
that kill slow- or non-replicating bacteria”. 
However, these assays do not fully capture the 
little-understood mechanisms whereby TB 
bacteria move in and out of latency? ™. 

In addressing this need, a new assay was 
recently developed for agents that kill M. tuber- 
culosis bacteria that reside in macrophages”, 
resulting in the discovery of Q208 (REF. 46). 
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Box 5 | Main checkpoint criteria for anti-tuberculosis hits and leads 


Guided by the specific requirements for the disease, and taking into account existing target product 
and candidate profiles, the committee coordinated by the Global Health Innovative Technology 
(GHIT) Fund devised the following disease-specific criteria for hits and leads for tuberculosis (TB). 


Validated hit 


¢ Cellular potency criteria: a hit should have a minimum inhibitory concentration (MIC) against 
the Mycobacterium tuberculosis laboratory strain H37Rv of <10 uM under replicating growth 
conditions, for example, under microplate Alamar Blue assay (MABA) conditions*” 


e Ahit requires a greater than a 10-fold difference between cytotoxicity against mammalian cells 
(Vero cells) and the MIC against M. tuberculosis H37Rv 


e Hits should show evidence of a preliminary structure—activity relationship (SAR) among analogues 


(See also BOX 2 for the evaluation criteria regarding the acceptability of chemical structure, novelty 


and confirmation) 


Early lead 


e Invitro potency criteria: a lead should have a MIC against the M. tuberculosis strain H37Rv of 
<1 uM under MABA conditions and a MIC <10 uM under non-replicating anaerobic conditions 


(low-oxygen-recovery assay, LORA* conditions) 


e Alead should exhibit in vitro activity against M. tuberculosis strains that are resistant to a single 
TB drug, such as isoniazid or rifampicin, indicating a new mechanism of action 


e Ademonstration of the in vitro bactericidal activity of a lead is required as indicated by 
time-kill curves showing that the number of colony-forming units (CFUs) decreases over time 
(for example, 14 days) compared with the number of CFUs at the beginning of the experiment 


e The oral bioavailability of a lead must be demonstrated in rodents 


e Ademonstration of the oral efficacy of a lead in a mouse acute infection model is required® 


e Apreliminary indication of the safety of a lead must be demonstrated (in hERG® and cell 


health assays™) 


The second challenge in discovering drugs 
that target latent TB requires an understanding 
of TB pathogenesis. M. tuberculosis primarily 
infects and replicates in activated macro- 
phages but can persist in a non-replicating 
state in foamy macrophages”. This host- 
pathogen system can form granulomas in 
which infected macrophages are surrounded 
by layers of active macrophages and other 
immune cells. The central granuloma region is 
oxygen-deprived and necrotized but contains 
latent M. tuberculosis; in the lung this results 
in cavities. The inner necrotic area is poorly 
irrigated, complicating local drug exposure. 
Granulomas do not normally form in in vivo 
models; however, recently, animal models 
of granulomas were developed”. In one of 
these models, it was shown that treatment 
with a vascular endothelial growth factor 
(VEGF)-specific antibody can restore vascu- 
lature and drug access in TB granulomas”. 
Continued research into new assays and mod- 
els is crucial, as well as pursuing completely 
different approaches that aim to stimulate host 
immune responses against TB”. 


Visceral leishmaniasis and Chagas disease. 
For visceral leishmaniasis, existing drugs have 
variable efficacy and serious toxicities*'; only 
one (miltefosine) is administered orally and 
the others are given by intravenous or intra- 
muscular injections, which are impractical. 


The goal for research must be to transform 
patient therapy from poorly adapted anti- 
monial treatments (for example, sodium 
stibogluconate (SSG)) to simple, patient- 
adapted oral therapies that are affordable, safe 
and efficacious in both children and adults. 
In the past 15 years, combinations of 
drugs with similar or improved efficacy to 
the older antimonials, but with improved 
safety and tolerability profiles, have been 
developed. These include combinations 
containing liposomal amphotericin B, paro- 
momycin and/or miltefosine”. However, 
these drugs remain costly, are difficult to 
administer, have poor stability at the high 
temperatures that occur in endemic regions, 
require lengthy treatment and/or are poorly 
tolerated. In addition, there is a dichotomy 
in drug efficacy in regions of the world 
where visceral leishmaniasis is endemic. 
In South Asia, the medical needs for visceral 
leishmaniasis are presently moderately to 
well met. However, in East Africa and Latin 
America, the efficacy and tolerability of 
current visceral leishmaniasis therapies 
remain a challenging area for improvement. 
Ideally, what is needed in the treatment 
of visceral leishmaniasis is a simple oral 
combination therapy that would prove to be 
advantageous and/or effective in maintaining 
or improving efficacy, improving tolerability 
and preventing or delaying the emergence of 


resistance. Furthermore, a treatment adapted 
to field conditions, with a shorter treatment 
duration and that could be used globally, 
would be optimal. In addition, to address 

the development of resistance by the parasite 
to drug monotherapies — as documented 
for current therapies — drugs with new and 
distinct mechanisms of action should be 
developed as combination therapies early in 
development and not used as monotherapies. 

In the case of Chagas disease, there are 
even fewer treatment options than there are 
for visceral leishmaniasis. Monotherapy with 
nifurtimox or benznidazole (both from 
the same nitroheterocycle class) remains the 
only recognized treatment, but these agents 
require long treatment courses, have variable 
efficacy and cause serious side effects, 
resulting in discontinuation of treatment in 
~20-30% of patients. It is crucial that new 
classes of effective, well-tolerated, orally acting 
and short-course treatments are progressed. 
into the clinic to provide improved options 
for patients. New classes of drugs for Chagas 
disease are also essential to enable the devel- 
opment of combination therapies to improve 
efficacy and toleration, reduce treatment dura- 
tion and combat the risk of the development 
of resistance to monotherapies. Recently, a 
trial in chronic Chagas disease was described 
for posaconazole, an inhibitor of T: cruzi 14-a 
demethylase (CYP51) that has a different 
mechanism of action to the currently used 
nitroheterocyclics, as exemplified by benzni- 
dazole®. In spite of good preclinical efficacy, 
the new compound did not meet expectations, 
resulting in more treatment failures than 
benznidazole. It is believed that the standard 
Chagas mouse model that was used to validate 
posaconazole only represents the early acute 
phase of infection; thus, it is important to use 
models and assays that also allow testing of 
the chronic late stage of the disease'?™*. 

In order to promote the discovery of safe, 
efficacious and orally acting treatments for 
visceral leishmaniasis and Chagas disease 
that overcome the limitations of existing 
regimens, discovery needs to focus on new 
chemical series, leaving behind the flawed 
classes in current use. Until recently, the 
steady identification of new chemical series 
has been hindered by a limited screening 
capacity coupled with very low hit rates. 
However, progress has been made and 
larger compound collections are now being 
screened. More than a million compounds 
have been screened against T. cruzi and 
Leishmania spp.***, but the rate of hit dis- 
covery still lags far behind that for malaria. 
It remains necessary to avoid discarding 
precious chemical series by setting the hit 
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selection criteria too high. Thus, hit criteria 
focus on the identification of new series 
and mechanisms of action with even 
modest activity against the intracellular 
forms of Leishmania donovani and T. cruzi, 
the causative agents of leishmaniasis and 
Chagas disease, respectively”. 

Although this inclusive approach to hit 
selection provides scope for subsequent 
HTL projects it does bring with it a high 
rate of early attrition and often leads to long 
optimization campaigns before quality leads 
are produced. The HTL entry criteria stated 
in BOX 6 are a combination of the current 
well-documented ‘best practices’ for targeted 
drug discovery projects* and a modest 
phenotypic in vitro activity hurdle”. Once 
the encouraging in vitro anti-parasitic activity 
of a new series can be coupled with sufficient 
in vivo plasma exposure, ideally following 
oral dosing, the next hurdle is to demonstrate 
a robust reduction in the parasite burden at 
target organs in infected rodents, which is the 
defining characteristic of a lead that is ready 
for subsequent optimization. Unfortunately, 
we have little knowledge regarding how 
in vitro potency correlates with in vivo activity 
in visceral leishmaniasis and Chagas 
disease, with a lack of pharmacokinetic and 
pharmacodynamic (PK/PD) examples. For 
T. cruzi infection, oral dosing is required for 
in vivo proof-of-concept studies, which is in 
line with the requirement for orally acting 
therapies. Although oral dosing is also pre- 
ferred for studies with rodents infected with 
Leishmania spp., intraperitoneal or intra- 
venous administrations are also acceptable 
given that a short-course treatment adminis- 
tered by the parenteral route could fulfil the 
TPP for visceral leishmaniasis. 


Discussion 

Our consultation among drug discovery 
experts collaborating with the GHIT Fund 
revealed differences in the criteria consid- 
ered most conducive to cost-efficient drug 
discovery, which are associated with the spe- 
cific requirements for the diseases reviewed 
here. One of the dangers in establishing 
stringent, clear-cut criteria for entry into the 
HTL phase and then progression into lead 
optimization is that valuable chemicals are 
discarded at an early stage. It is obvious that 
a molecule such as artemisinin, were it to 
emerge today as a hit in a high-throughput 
screening campaign, would severely struggle 
to be taken forward, scoring extremely low 
in chemical tractability, ease of synthesis 
and chemical suitability (the compound 

has a highly reactive peroxide). Even if it 
were taken further (without analysis of the 
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Box 6 | Hit and lead criteria for Chagas disease and visceral leishmaniasis 


Guided by the specific requirements for the disease, and taking into account existing target 
product and candidate profiles, the committee coordinated by the Global Health Innovative 
Technology (GHIT) Fund devised the following disease-specific criteria for hits and leads for 


Chagas disease and visceral leishmaniasis. 


Hit and lead criteria for Chagas disease 


¢ Key hit selection criteria: cellular potency — a hit should have a half-maximal inhibitory 
concentration (IC,,) <10 uM against intracellular Trypanosoma cruzi 


e Key lead selection criteria: in an acute mouse model of Chagas disease, a hit should demonstrate an 
80% reduction of parasite burden in organs or tissues, or there should be no parasites detected at 
the end of treatment and an increase in lifespan with up to 10 doses at 50mg per kg delivered orally 


Hit and lead criteria for visceral leishmaniasis 


° Key hit selection criteria: cellular potency — a lead should demonstrate an IC,, <10 uM against 


intracellular Leishmania donovani 


¢ Key lead selection criteria: in a mouse (or hamster) model (infected with L. donovani or 
Leishmania infantum), treatment with a lead should result in a >70% reduction in liver parasite 
burden after at most 5 doses at 50 mg per kg delivered orally once or twice per day 


structure-activity relationship (SAR)), 
scientists would discover that the compound 
has a very short in vivo half-life (minutes), is 
metabolically unstable and its mode of action 
and target are poorly understood®. Along 
with serious preclinical safety concerns**™, 
there is every reason to believe that the 
modern drug discovery process would have 
discarded the drug long before it had a chance 
to save the lives of millions. In fact, algorithms 
that select ‘drug-like’ molecules generally 
exclude artemisinin-like molecules from the 
chemical libraries used for HTS in the first 
place. Chemical tractability is important, but 
it must be kept in mind that there are many 
other molecules that have progressed straight 
from a phenotypic screen (that is, without 
further chemical modification) into patients, 
such as tamatinib (also known as R406)*, 
paclitaxel™, rapamycin® and cyclosporine”. 
Conversely, there are numerous examples 
in which chemical series were pursued far 
too long (in hindsight), soaking up valuable 
resources, time and careers that should have 
been invested elsewhere. As an additional 
complication in navigating between these two 
extremes, the TPPs may shift with changes in 
the clinical landscape for infectious diseases 
that affect people in the developing world; for 
example, such changes may include a shift in 
policy from curing patients ad hoc to eradi- 
cation programmes that require mass drug 
administration, a decision to focus on the 
prevention of transmission rather than 
on cures, changing disease priorities and 
pharmacoeconomics, the emergence of 
co-infections (with concomitant drug-drug 
interaction risks) or the spread of resistance. 
Nevertheless, in the midst of all these com- 
plexities, project teams can attempt to steer 
their compounds using the set of criteria and 


considerations presented here in the light of 
the four sets of disease TPPs as beacons. 

Our analysis has split recommendations for 
entry into HTL and lead optimization phase 
into a generic part and a more specific part 
for each disease. As stated earlier, these guide- 
lines were established in the context of GHIT 
Fund-coordinated collaborations, but we 
hope that they will find wider adoption and 
aid the acceleration and expansion of pipeline 
drug candidates for these serious diseases. 
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COMMENT 


A community-based approach 
to new antibiotic discovery 


Matthew A. Cooper 


There has been much written on the ‘superbug crisis’ 
over the past decade. Commonly cited causes include 
the over-use of antibiotics in humans and animals and 
the exit of many companies from antibacterial drug 
research and development (R&D) owing to the chal- 
lenging market economics'. However, there is another 
important cause — it is simply getting more and more 
difficult to find new antibacterial drugs. Most of the 
current arsenal of antibacterial drugs originated from 
screening of natural products in the 1950s-1970s. 
However, the ‘low-hanging fruit’ from this strategy 
appears to have been exhausted, and one alternative 
— target-based screening using knowledge of bacterial 
genomics — has failed to replenish the drug pipeline’. 
Consequently, there has been a revival in interest in 
establishing antimicrobial screening platforms to inter- 
rogate previously untapped sources of chemical diversity. 


New sources of chemical diversity 

One way to gain access to new chemical diversity that 
could be highly relevant for antibacterial drug dis- 
covery is to isolate novel natural products produced by 
microbes; for example, from bacteria that have previ- 
ously been ‘unculturable’. However, when a new natural 
product with antibacterial activity is found, it is not easy 
to isolate and scale up the production of sufficient mate- 
rial needed for preclinical development. Furthermore, 
antibiotic-resistance mechanisms to a novel natural 
product will presumably already be present in the 
bacterial community, given the need for the organism 
producing the antibiotic to protect itself against the 
effects of the antibiotic and the defensive mechanisms 
evolved by other bacteria to the agent. 

The good news is that we have already made many 
other molecules that have not been encountered by 
microbes before, and that have yet to be tested for anti- 
microbial activity. At the time of writing, 94 million com- 
pounds had been deposited in the Chemical Abstract 
Service (CAS) Registry; 80 million of these were organic 
compounds with no metal ion and a molecular mass 


The Community for Open Antimicrobial Drug Discovery aims to tap into the potential of the 
millions of compounds distributed around laboratories globally to be a source of new antibiotic 
leads by offering free screening for antimicrobial properties, with no strings attached. 


of <1,500 Da. Chemists around the world synthesize 
thousands of new organic compounds each year, but 
most of these compounds never leave their laboratories. 
Individually, such compounds may have little value, but 
taken collectively as part of a ‘global collection’ of unusual 
synthetic molecules, they could be much more useful. 
Chemists — and academic chemists in particular — 
make molecules for many reasons. Most chemists do 
not work in drug design, but are instead interested in the 
development of new synthetic methods and novel struc- 
tures, and so are unconstrained by concerns about ‘drug- 
like’ properties. By contrast, pharmaceutical companies 
hold large collections of compounds that are curated to 
maximize the likelihood of oral bioavailability, and biased 
towards major human target families, such as G protein- 
coupled receptors and kinases. Furthermore, antibiotics 
occupy very diverse chemical space, and many are ‘suicide 
inhibitors’ of microbial enzymes and often possess reac- 
tive groups that are normally filtered out* in corporate 
collections. 

So, how many of the 80 million organic compounds in 
the CAS Registry might have the potential to be starting 
points for new antibacterial drugs? Applying a simple filter 
for antibacterial-like properties® (calculated logP between 
-10 and 2, and molecular mass < 1,200 Da) to this group 
yielded 29 million compounds. The majority of these 
compounds are not commercially available, and most will 
never have been screened for antibacterial activity. Impor- 
tantly, however, any compounds within this subset would 
have a known synthetic route, facilitating the scaling up 
of production of validated hits for further study and the 
exploration of analogues with improved properties. 

Given these initial considerations, the key question 
is: are there novel, high-quality antimicrobial hits among 
the millions of synthetic organic compounds distributed 
around laboratories globally? In 2015, we launched the 
Community for Open Antimicrobial Drug Discovery 
(CO-ADD), which is funded by the Wellcome Trust and 
The University of Queensland, to engage the worldwide 
chemistry community in an effort to answer this question. 
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A community approach: CO-ADD 

CO-ADD is an open-access facility, in that any chemist 
in the world can have their compounds tested for anti- 
microbial potency and cytotoxicity, free of charge and 
with no encumbrance on intellectual property (IP). 

Compounds submitted to CO-ADD undergo a pri- 
mary screen in a 384-well format against representatives 
of 5 key bacterial pathogens (Escherichia coli, Klebsiella 
pneumoniae, Acinetobacter baumannii, Pseudomonas aer- 
uginosa and methicillin-resistant Staphylococcus aureus 
(MRSA), as well as the fungi Cryptococcus neoformans and. 
Candida albicans. Membrane-deficient and efflux pump- 
impaired E. coli mutants are also assayed to enhance 
our knowledge surrounding antibiotic activity, pene- 
tration and inactivation. Subsequent hit confirmation 
includes dose-response antimicrobial assays, as well as 
counter-screening for adverse effects (using cytotoxicity, 
critical micelle concentration and membrane disruption 
assays), which is then followed by a cascade of assays 
for hit validation. Here, the hit compounds are assayed 
against a broader panel of multidrug-resistant clinical 
isolates and reference strains and tested in the presence 
of serum and lung surfactant, and then assessed for 
blood haemolysis, microsomal and plasma stability and 
drug-protein binding. 

Together, these data support nomination of a vali- 
dated hit and form the basis for optimization towards 
a candidate antibiotic. At this stage, it is not possible to 
conduct lead optimization for free. However, such activi- 
ties could potentially be funded through schemes such as 
the Wellcome Trust Seeding Drug Discovery awards, the 
US National Institute of Allergy and Infectious Diseases 
R21/R33 awards and the European Innovative Medicines 
Initiative ENABLE programme. 

One key aspect of the CO-ADD programme is the 
use of standardized testing based on well-defined, con- 
sistent protocols. Many other fields of drug discovery 
use standard assays and instrumentation; however, in 
antimicrobial discovery, a multitude of assays (including 
disc diffusion, Etest and broth or agar dilution assays), 
strains, media and plate types are employed, meaning 
a direct comparison of results between laboratories is 
challenging. Clinical and Laboratory Standards Institute 
(CLSI)-compliant, standardized conditions and refer- 
ence strains will be used throughout the programme, so 
that researchers will finally be able to compare thousands 
of compounds assayed under identical conditions. As 
more compounds are screened, the collated data, which 
will be made publicly available for both active and non- 
active compounds, will become increasingly valuable 
for gaining understanding not just of novel chemotypes 
associated with antibacterial activity, but also of char- 
acteristics that influence other key properties, such as 
penetration into bacterial cells and avoidance of efflux. 

CO-ADD is a hybrid of existing successful models: 
those that provide access to facilities (for example, 
GlaxoSmithKline’s Open Lab Foundation at Tres Cantos, 
and Eli Lilly’s Open Lab), and discovery consortia (such 
as the US National Institutes of Health Molecular 
Libraries Program, the Tropical Disease Initiative, 
Collaborative Drug Discovery and EU-OPENSCREEN). 


CO-ADD is differentiated from these by its specific focus 
on antimicrobial screening and discovery, with an 
extremely low barrier to participant entry. There are no 
limitations on geography, nor expert panels that review 
molecules for ‘suitability’ to screen. The approach is 
fully inclusive — if the compound is soluble in water or 
dimethyl sulfoxide, not radioactive, not pyrophoric and 
not an illicit drug, CO-ADD will screen it. 


Challenges and outlook 
In the golden age of antibiotic discovery, most scientists 
would collaborate openly with each other. Now university 
technology transfer offices are reluctant to allow research- 
ers to transfer compounds, assays or reagents without 
non-disclosure agreements and material transfer agree- 
ments (MTAs), which can take many months to agree — 
even for limited, short experiments. CO-ADD provides 
simple terms and conditions online and a signature-free 
MTA download if needed, which together warrant that 
the provider of the compound retains all rights to the 
compound, assay results and IP. Furthermore, research- 
ers are initially only asked to provide the molecular mass 
of the compound, and a structural fingerprint if possible. 
CO-ADD participants have 18 months to patent and 
develop their compound before they are asked to make 
structures and results available to the open-access database, 
and if 18 months is too short, this period can be extended 
on request. Hopefully, this compromise will lead to a valu- 
able repository of structure-activity and structure-toxicity 
relationships, while allowing sufficient time for IP related 
to promising compounds to be obtained in order for 
commercially focused clinical development to progress. 
There is also a practical barrier to entry — the simple 
act of weighing out and shipping a compound. Here, 
the CO-ADD kick-start programme provides financial 
incentives to labs to defray costs associated with sample 
collection and shipping. Our outreach strategy is raising 
awareness and building a network of advocates through 
conferences and presentations globally, and the pro- 
gramme has been received with enthusiasm by individuals, 
laboratories and university and institute collections. 
Very few completely new classes of antibacterial or 
antifungal drugs have been approved in the past two 
decades. CO-ADD is just a small part of what we hope 
will be a renaissance in new R&D and business models 
and technologies that together can provide a sustainable 
solution to the superbug crisis. 
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Pioneering government-sponsored 
drug repositioning collaborations: 
progress and learning 


Donald E. Frail, Madeleine Brady, K. Jane Escott, Alison Holt, 
Hitesh J. Sanganee, Menelas N. Pangalos, Chris Watkins and Craig D. Wegner 


Abstract | Anew model for translational research and drug repositioning has 
recently been established based on three-way partnerships between public 
funders, the pharmaceutical industry and academic investigators. Through two 
pioneering initiatives — one involving the Medical Research Councilin the United 
Kingdom and one involving the National Center for Advancing Translational 
Sciences of the National Institutes of Health in the United States — new 
investigations of highly characterized investigational compounds have been 
funded and are leading to the exploration of known mechanisms in new disease 
areas. This model has been extended beyond these first two initiatives. Here, we 
discuss the progress to date and the unique requirements and challenges for 


this model. 


Drug repositioning — also referred to as 
repurposing, reprofiling, rescue, or indica- 
tions discovery — is the process of identifying 
anew use for an existing drug or drug 
candidate in an indication outside the scope 
of the original indication. The case was 
made a decade ago that drug repositioning 
is one strategy to address the declining pro- 
ductivity of the pharmaceutical industry, as 
starting with a well-characterized compound 
decreases the duration of clinical develop- 
ment and could reduce attrition owing to 
issues such as poor pharmacokinetics 

or insufficient safety'. Furthermore, well- 
characterized clinical-stage compounds 

can be used to investigate novel disease 
hypotheses in human studies, which is 
important given the limitations of studies 
in animal models’, including the lack of 
predictivity of the efficacy of compounds 

in subsequent clinical trials and issues with 
reproducibility (see the Nature focus on 
challenges in reproducible research). Making 
such compounds more widely available helps 
to drive hypothesis creation and can lead to 
broader testing in humans. 

Multiple collaborations have since been 
established to enable the investigation of 
scientific advances within academia using 
drugs and drug candidates from industry. 
For example, in 2010, Pfizer’s Indications 
Discovery Unit and the Washington 
University School of Medicine formed 
a partnership that provided access to 


proprietary data for a large portfolio of 
active and discontinued Pfizer drug candi- 
dates and allowed investigators to propose 
and collaborate on preclinical or clinical 
studies to investigate new uses**. Around 
the same time, the UK Medical Research 
Council (MRC) was seeking to enable a 
greater understanding of the mechanisms 
of human disease through experimental 
medicine studies enabled by access to 
high-quality compounds from industry. 
Although this strategic intent is distinct 
from that of a drug repositioning effort, 

the two complement each other. Meanwhile, 
Francis Collins, Director of the US National 
Institutes of Health (NIH), advocated 

for a more comprehensive repositioning 
programme that harnessed the strength of 
various stakeholders’. The NIH convened a 
roundtable of leading representatives from 
academia, the US government and private 
sector research and development (R&D) 

to explore such strategies, including one in 
which pharmaceutical companies would 
create a pool of compounds for further 
investigation by academia through a gov- 
ernment grant programme’. Thus, in 2011 
the MRC, in partnership with AstraZeneca, 
implemented the Mechanisms for Human 
Diseases Initiative, and in 2012 the NIH, 
through the National Center for Advancing 
Translational Sciences (NCATS), imple- 
mented the Discovering New Therapeutic 
Uses for Existing Molecules initiative. 


PERSPECTIVES 


The announcements of these 
programmes created some controversy, 
with some questioning the role of the NIH 
and NCATS as drug developers and the 
value of drug repositioning (for example, 
see REF. 6 and further discussion below). 
Given the timeframes of drug development, 
it may be several more years before the full 
value of these programmes is clear. Here, we 
make an interim assessment of the progress 
to date, discussing the unique requirements 
and challenges encountered with these 
programmes, as well as early indicators 
of success. 


Programme characteristics 

The MRC Mechanisms of Human Disease 
Initiative. This programme, a partnership 
between the MRC and AstraZeneca that 
was launched in 2011, provided academic 
researchers with unprecedented access 

to a high-quality collection of clinical and 
preclinical AstraZeneca compounds in 
order that they could propose new research 
into human disease mechanisms and 

the development of potential therapeutic 
interventions. The two partners had 
different but complementary motives. 

For the MRC, the initiative supported the 
MRC Translational Research Strategy, 
which has a strong emphasis on experi- 
mental medicine research to understand 
the biology of human disease and includes, 
where appropriate, preclinical studies 
using model systems. Successful preclinical 
studies could stimulate further pursuit of 
mechanistically related compounds in the 
clinic. The development of potential thera- 
peutic interventions was not a primary goal, 
but it was hoped that successful studies and 
an increased understanding of the mecha- 
nisms involved in human disease, which 
would go into the public domain via peer- 
reviewed publications, would lead to the 
development of new medicines for patients 
by AstraZeneca or other companies. For 
AstraZeneca, the programme also provided 
the opportunity to build stronger relation- 
ships with members of the UK academic 
community with knowledge across a broad 
range of diseases. 

Criteria were established for identifying 
which AstraZeneca compounds could be 
made available for external research. 

The focus was on identifying either develop- 
ment compounds that were discontinued 
and considered suitable for additional clinical 
studies (including confidence that target 
engagement is achievable), which were 
offered for clinical and preclinical proposals, 
or development compounds that were no 
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longer deemed suitable for clinical studies 
based on available clinical data (for example, 
lack of target coverage or sufficient safety) 
or limitations identified preclinically 

(for example, emerging preclinical long-term 
safety data), which were offered for 
preclinical proposals only. 

AstraZeneca filtered through over 450 
compounds that had been nominated for 
clinical development and identified a total of 
22 suitable compounds (see Supplementary 
information S1 (table)). These compounds 
had been extensively characterized, including 
their potency, selectivity, pharmacology, 
complete pharmacokinetic and safety 
packages, target engagement and previous 
use in humans. For each compound, internal 
data were reviewed and summaries of the 
most relevant information were developed 
to enable investigators to craft a suitable 
proposal (see Supplementary information $2 


(table)). This information was posted on the 
MRC website (that is, in the public domain) 
to attract the most innovative ideas from 
MRC-eligible investigators. At the time, 

this compendium of information was the 
largest single public source of what was 
proprietary information regarding efficacy, 
safety and other information on discontinued 
development compounds. 

The MRC initiated a call for ‘concept 
proposals, directing investigators to the 
compounds on the website. Over the 8-week 
open call period, more than 100 proposals 
were submitted from 37 different UK institu- 
tions, across all compounds and spanning a 
range of disease areas. The geographic diver- 
sity and breadth of ideas indicated that the 
concept of crowdsourcing was successful. 

About half of the proposals fell within 
disease areas of focus for AstraZeneca, 
with the remainder falling into areas outside 
core therapeutic areas of interest, although 
whether the proposal fell within an area of 
interest to AstraZeneca was not a criterion 
used for review. A committee comprised of 
senior UK scientists convened by the MRC 
provided an initial review of the proposals 
for scientific merit, and independently, 
AstraZeneca provided a review of both the 
feasibility and suitability of the compound 
for the proposed investigations and the 
novelty of the studies. AstraZeneca was 
able to review these proposals owing to a 
confidentiality disclosure agreement (CDA) 
between AstraZeneca and the MRC, which 
covered these activities. 

Informed by these reviews, a joint 
MRC-AstraZeneca committee then identi- 
fied 25 proposals to advance to full proposal. 
The full proposals were collaboratively 


developed by the UK investigators and 
AstraZeneca scientists, and a champion of 
the proposal from AstraZeneca was required 
to proceed. Given the innovative funding 
approach taken, the collaboration between 
UK investigators and AstraZeneca scientists 
was essential for the development and 
pursuit of highly competitive and scientifi- 
cally robust proposals. The full proposal 
development process was intense, with 
some complex clinical investigations being 
considered within short timelines. In all 
cases, AstraZeneca researchers were named 
co-applicants on the proposals, which were 
led by an academic. The full applications 
were considered by a specially convened 
MRC committee who were informed by 
international peer review comments; there 
was no AstraZeneca representation in the 
committee to ensure funding decisions were 
based solely on scientific merit and feasibility, 
without any commercial bias. 

In total, 15 collaborative proposals — 
7 preclinical and 8 clinical — were funded by 
the MRC and are now underway (TABLE 1). 
The collection of studies is highly diverse 
and each study is for a different indication, 
ranging from common disorders 
(for example, Alzheimer disease) to orphan 
diseases (for example, muscular dystrophy) 
and including indications outside of 
AstraZeneca’ core areas of focus. Funding for 
the studies is provided to the investigators 
by the MRC (no MRC funds are provided to 
AstraZeneca). AstraZeneca, in addition 
to providing collaborative insight and sug- 
gestions to the full proposals, is responsible 
for providing the necessary drug supply and 
documents to support the regulatory and 
ethics committee filings by the investigators, 
as well as coordination of any adverse events 
when a compound is the subject of more 
than one study. The clinical studies are 
sponsored by the investigator, who is there- 
fore ultimately responsible for the conduct 
of the study. 


NIH-NCATS Discovering New Therapeutic 
Uses for Existing Molecules. The pilot 
programme from NCATS was initiated 

in 2012 (see Further information). This 
programme matched NIH-funded researchers 
with a selection of 57 compounds pre- 
viously discontinued from development 
(see Supplementary information S1 

(table)). In this case, multiple companies 
were involved, with Abbott (supplying 3 
compounds), Bristol-Myers Squibb (3), 
GlaxoSmithKline (4), Johnson & Johnson (3) 
and Sanofi (10) joining the founding com- 
panies of AstraZeneca (14), Eli Lilly (4) and 


Pfizer (17). Although similar in essence 

to the MRC programme, there were also 

substantial differences (TABLE 2): 

¢ Purpose of the programmes. The primary 

goal of the MRC programme was to 
investigate mechanisms of human dis- 
ease. Therefore, the MRC programme 
included preclinical studies, early 
concept-testing human studies and/or 
Phase II proof-of-concept studies. 
By contrast, the NCATS programme 
required the full proposal to contain 
a statistically powered Phase II 
proof-of-concept study. Neither the MRC 
programme nor the NIH programme 
was a global programme — each tar- 
geted only eligible investigators in the 
United Kingdom or the United States, 
respectively, although both were essential 
in building the foundation for a global 
programme as is discussed below (see 
also the AstraZeneca Open Innovation 
website). 

¢ Compound criteria. The NCATS 

programme required compounds to have 
prior evidence of target coverage and 
manageable tolerability in humans, to 
potentially enable confident hypothesis 
testing in a new indication. The MRC 
programme also included compounds 
suitable for only preclinical use with evi- 
dence of potency, selectivity and exposure 
(typically by the oral route) in preclinical 
models. In both programmes, com- 
pounds were no longer in active develop- 
ment (that is, they were discontinued) 

to avoid any concern that public funding 
might be supplementing an ongoing 
commercial development objective. 

e Review process. In the NCATS programme, 
the review of the concept proposals did 
not involve the companies, and proposals 
that were not selected for full proposal 
were not seen by the industry partners. 
The pharmaceutical companies were 
engaged under a CDA for those proposals 
selected for full development and could 
deny support of the proposal, thereby 
preventing full proposal submission. 

As with the MRC programme, the final 
funding decisions were made without 
company input. 

Template agreements. NCATS required 
each company to prepare and publicly 
post online template CDAs and 
collaborative research agreements 
(CRAs) at the time of the announcement 
of the programme, enabling early review 
by technology transfer offices and rapid 
implementation of such agreements. 

In addition, NCATS required a signed 


834| DECEMBER 2015 | VOLUME 14 


© 2015 Macmillan Publishers Limited. All rights reserved 


www.nature.com/reviews/drugdisc 


PERSPECTIVES 


Table 1 | Government-sponsored collaborative drug repositioning projects 


Project focus* 


MRC-funded projects 


Saracatinib (AZD0530) as a novel analgesic for 
cancer-induced bone pain 


Assessing the therapeutic efficacy of an 11BHSD1 
inhibitor (AZD4017) in idiopathic intracranial 
hypertension 


Evaluation of the selective endothelin 
A-receptor antagonist zibotentan (AZD4054) 
as a treatment for renal disease in systemic 
sclerosis (scleroderma) 


Phase II study of the impact of a selective 
11BHSD1 inhibitor (AZD4017) on biochemical 
markers of bone turnover in post-menopausal 
osteopaenia 


The role of GABA, receptor mechanisms in 
chronic cough (using AZD3355) 


Exploring GABA, «2,3 signalling as novel therapy 
for peripheral neuropathies and primary dystonias 
(using AZD7325) 


SRC inhibitors (AZD0530) as potential 
antipsychotics: human testing with psilocybin 


Anew paradigm for testing pathway tractability in 
lung disease (using an MMP9 and MMP12 inhibitor 
(AZD1236)) 


The role of MMP inhibitors (AZD1236) in 
ameliorating muscular dystrophy 


Investigating ATP regulation and P2X7 blockade 
(AZ11657312) in acute renal injury and its long-term 
complications 


Efficacy of saracatinib (AZD0530) in treatment of 
chronic otitis media in preclinical mouse models 


Evaluation of AZD1080 (GSK36 inhibitor) in a 
preclinical mouse model of motor neuron disease 


Endothelin-1-mediated reduction of cerebral blood 
flow in Alzheimer disease: therapeutic potential of 
zibotentan (AZD4054) 


GSK3 as a multifunctional target for glioblastoma 
treatment; hitting multiple tumour hallmarks with 
asingle drug (AZD2858) 


NIH-NCATS-funded projects 
The efficacy and safety of a selective oestrogen 


receptor-f agonist (LY500307) 


FYN inhibition by AZD0530 for Alzheimer disease 


Medication development of a novel therapeutic for 
smoking cessation 


Collaborators, institution and industry 
partner 


e Dr D. Andrew, University of Sheffield, UK 
e AstraZeneca 


e DrA. Sinclair, University of Birmingham, UK 
e AstraZeneca 


e Professor C. Denton, University College 
London, UK 
e AstraZeneca 


e Professor P. Stewart, University of Leeds, UK 
e AstraZeneca 


e Professor J. Smith, University 
of Manchester, UK 
e AstraZeneca 


e Professor M. Koltzenburg, University 
College London, UK 
e AstraZeneca 


e Professor D. Nutt, Imperial College 
London, UK 
e AstraZeneca 


e DrN. Hirani, University of Edinburgh, UK 
e AstraZeneca 


e Professor D. Wells, Royal Veterinary 
College, London, UK 
e AstraZeneca 


e Professor R. Unwin, University College 
London, UK 
e AstraZeneca 


e Dr M. Cheeseman, University of 
Edinburgh, UK 
e AstraZeneca 


e DrR. Mead, University of Sheffield, UK 
e AstraZeneca 


e Professor S. Love, University of Bristol, UK 
e AstraZeneca 


e Dr S. Short, University of Leeds, UK 
e AstraZeneca 


e DrA. Breier, Indiana University, 
Indianapolis, USA 
° Eli Lilly S Co. 


e Professor S. Strittmatter, Dr H. Nygaard 
and Professor C. Van Dyck, Yale University, 
New Haven, Connecticut, USA 

e AstraZeneca 


e DrH. Brunzell, Virginia Commonwealth 
University, Richmond, Virginia, USA 

e DrK. Perkins, University of Pittsburgh, 
Pennsylvania, USA 

e Janssen Research & Development, LLC 


New indication (original 
indication) 


Cancer-induced bone pain 
(solid tumour) 


Idiopathic intracranial 
hypertension (diabetes and 
obesity) 


Renal scleroderma (prostate 
cancer) 


Post-menopausal osteopaenia 
(diabetes and obesity) 


Chronic cough 
(gastroesophageal 
reflux disease) 


Dystonia or neuropathy 
(anxiety) 


Psychosis (solid tumour) 


Idiopathic pulmonary 
fibrosis (chronic obstructive 
pulmonary disease) 


Muscular dystrophy (chronic 
obstructive pulmonary 
disease) 


Acute kidney injury 
(rheumatoid arthritis) 


Chronic otitis media (solid 
tumour) 


Amyotrophic lateral sclerosis 
(Alzheimer disease) 


Alzheimer disease (prostate 
cancer) 


Glioblastoma (Alzheimer 
disease) 


Schizophrenia (benign 


prostatic hyperplasia) 


Alzheimer disease (solid 
tumour) 


Smoking cessation (psoriasis 
and rheumatoid arthritis) 


Type of 
project 


Preclinical 
and clinical 


Clinical 


Clinical 


Clinical 


Clinical 


Clinical 


Clinical 


Preclinical 
and clinical 


Preclinical 


Preclinical 


Preclinical 


Preclinical 


Preclinical 


Preclinical 


Clinical 


Clinical 


Clinical 
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Table 1 (cont.) |Government-sponsored collaborative drug repositioning projects 


Project focus* 


Anovel compound for alcoholism treatment: 
a translational strategy 


Partnering to treat an orphan disease: 
Duchenne muscular dystrophy 


Reuse of ZD4054 for patients with symptomatic 


peripheral artery disease 


Therapeutic strategy for lymphangioleiomyo- 
matosis (AZD0530 (saracatinib)) 


Therapeutic strategy to slow progression of calcific 


aortic valve stenosis 


Translational neuroscience optimization of 
GlyT1 inhibitor 


Collaborators, institution and industry 
partner 


e Professor F. Akhlaghi, University of Rhode 
Island, Kingston, New York, USA 

¢ Dr L. Leggio, National Institute on Alcohol 
Abuse and Alcoholism and National 
Institute on Drug Abuse, Bethesda, 
Maryland, USA 

e Pfizer 


e Dr K. Wagner, Kennedy Krieger Institute, 
Baltimore, Maryland, USA 

e Dr S. Froehner, University of Washington, 
Seattle, USA 

e Sanofi 


e Dr B. Annex, University of Virginia, 
Charlottesville, USA 
e AstraZeneca 


e Dr T. Eissa, Baylor College of Medicine, 
Houston, Texas, USA 
e AstraZeneca 


e DrJ. Miller, Dr M. Enriquez-Sarano and 
Dr H. Schaff, Mayo Clinic, Rochester, 
New York, USA 

e Sanofi 


e DrJ. Krystal, Yale University, New Haven, 
Connecticut, USA 
e Pfizer 


New indication (original Type of 
indication) project 
Alcoholism (type 2 diabetes) Clinical 
Duchenne muscular dystrophy Clinical 
(not reported) 

Peripheral arterial disease Clinical 
(prostate cancer) 

LAM and TSC (solid tumour) Clinical 
Calcific aortic valve stenosis Clinical 
(not reported) 

Cognitive deficits in Clinical 


schizophrenia (schizophrenia) 


*Compound codenames are provided where possible. 11BHSD1, 118-hydroxysteroid dehydrogenase type 1; GABA, y-aminobutyric acid; GlyT1, glycine transporter 1; 
GSK3, glycogen synthase kinase 3; LAM, lymphangioleiomyomatosis; MMP, matrix metalloproteinase; MRC, UK Medical Research Council; NCATS, National Center 
for Advancing Translational Sciences; NIH, National Institutes of Health; TSC, tuberous sclerosis complex. 


CRA to be submitted with the final full 
proposal, which provided a deadline for 
any negotiations, whereas CRAs for the 
MRC programme were negotiated after 
the awards were granted to minimize 
the administrative burden of negotiating 
agreements with companies that might 
not have been awarded funding. The 
MEC process was somewhat facilitated 


by existing templates — the NIHR-MRC 


model Industry Collaborative Research 
Agreement (mICRA) and the UK 
Government Lambert agreements — 
which were further adapted for the pur- 
poses of this specific initiative. Despite 
the existence of the template agreements, 
some negotiation was still required. 


Ultimately, nine clinical proposals were 
funded by the NIH, as summarized in 
TABLE | (see REF. 7 for an overview of this 
programme). 


National Research Program for 
Biopharmaceuticals. In 2013, a similar 
programme was established between 
AstraZeneca and the National Research 
Program for Biopharmaceuticals (NRPB) 
in Taiwan to facilitate translational research 
locally. This programme combines elements 


of the NIH-NCATS programme (for 
example, posting of a template CRA) and 
the MRC programme (both clinical and 
preclinical-only proposals supported). 
However, for the first time in these relation- 
ships, compounds actively being pursued 
in development (‘live’ compounds) at 
AstraZeneca were included, thereby setting 
a new precedent in this type of setting (see 
Supplementary information S1 (table)). 
One clinical and two preclinical projects 
were funded and are in progress. Additional 
proposals resulting from the networking 
and relationships established under this 
programme are now under collaborative 
discussions between the investigator, 
AstraZeneca and the NRPB. 


Responses to the programmes 
The announcement of the MRC programme 
was met with enthusiasm and seen as an 
exciting, unprecedented opportunity by 
UK investigators. When the NIH-NCATS 
programme was announced, it too was seen 
as an unprecedented opportunity, although 
there was some scepticism and criticism. 
Some criticized the allocation of 
public funds to investigate company com- 
pounds and questioned the return to the 
public sector. However, academic clinical 


investigations, including those involving 
company compounds, are routinely 
supported by public funds, and these pro- 
grammes provided access to compounds for 
mechanisms that have not previously been 
available. John LaMattina, former President 
of Research and Development at Pfizer, was 
one of many who criticized the involvement 
of academic investigators in drug develop- 
ment’. Such critics argued that the NIH 
(broadly used as a term to apply to academic 
researchers) lacks the experience required 
to develop drugs. Importantly, however, the 
intent of the programmes was to identify 
new uses of existing compounds and known 
mechanisms, not to develop drugs through 
to regulatory approval. Furthermore, pre- 
vious experience of efforts to test company 
compounds in different tumour types, sup- 
ported by the US National Cancer Institute 
(NCI), demonstrated that academic inves- 
tigators are capable of achieving this goal. 
Examples in which NCI involvement in 

the early stages of development ultimately 
resulted in new medicines include cisplatin 
for the treatment of testicular, ovarian and 
lung cancer, and paclitaxel and fludarabine 
phosphate for the treatment of several 
cancers and lymphoma, respectively. 

In addition, projects supported by the 
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NCATS programme required company 
collaboration to develop the final proposal, 
thus combining the experience and knowledge 
of both academic and industry investigators. 

Another criticism was that company 
scientists have already exhaustively con- 
sidered alternative indications for their 
compounds. This is not always true, as 
companies typically have therapeutic areas 
of focus and only invest time and money in 
those. The indications considered by these 
programmes were unlimited and utilized 
the broad expertise lying outside pharma- 
ceutical companies. More importantly, 
science continuously progresses with new 
discoveries, and some of these discoveries 
could lead to a new use for an existing com- 
pound, for example, unpublished data from 
investigators linking a target to a disease. 
Furthermore, the totality of the available 
data may not be sufficient for investment by 
the company, and additional preclinical or 
clinical data may change interest levels. 

There was also criticism of these pro- 
grammes because the compound structures 
were not initially disclosed®. This criticism 
primarily came from academic groups doing 
in silico drug repositioning, in which struc- 
tural features of one compound are found to 
be similar to those of a drug with a known 
effect that acts through a different target 
(off-target activity). However, the purpose of 
the programmes was hypothesis-based repo- 
sitioning with defined mechanisms of action, 
not to generate new hypotheses for an indi- 
vidual molecule working through off-target 
biology or other repositioning approaches. 

It was concluded that non-hypothesis-based 
investigations were not within the scope of 
these programmes and that the extensive 
compound information provided to investi- 
gators was sufficient to meet the goals of the 
programme, thus compound structures were 
not required. Nevertheless, Southan et al. did 
take the initiative to search publicly available 
databases to identify most, if not all, of the 
compound structures in the NIH-NCATS 
programme, although the accuracy was not 
confirmed by the companies (see REF. 8 and 
the Southan figshare page). 

Caution and scepticism was also evident 
within pharmaceutical companies. Many 
clinical compounds, although listed as 
discontinued after failing to improve on 
the current standard of care in their initial 
Phase II indication, remain under preclinical 
evaluation for alternative indications. 

This status typically persists for a few years, 
during which time R&D leaders can be 
reluctant to allow the compound to be 
tested externally. By the time this internal 


deliberation is exhausted, interest in the 
compound and its mechanism of action, both 
within and outside the company, and the 
patent life have diminished. In addition, as 
R&D budgets have decreased over the past 
decade many companies have narrowed their 
therapeutic area focus, leading to hesitation 
by some to support investigator-sponsored 
trials in non-core therapy areas. Budget 
constraints can therefore make it challenging 
for companies to supply the clinically formu- 
lated drug (and matched placebo) substance, 
updated regulatory documents (for example, 
the Investigator’s Brochure and Chemical, 
Manufacturing and Controls section) and 
scientific and clinical compound-specific 
advice for disease areas that are not a high 
priority. Recent activities, described below, 
indicate that more and more companies 
have overcome these initial concerns. 


Unique requirements and challenges 
Clinical trials generally fall into one of two 
groups: company-sponsored studies or 
investigator-sponsored studies. To date, 
investigator-sponsored studies, regardless 

of the funder, almost always use live com- 
pounds that are in development or on the 
market. In these circumstances, existing pro- 
ject teams provide the required compound 
insight, regulatory document updates, safety 
database access and so on, and they are 
ultimately the decision makers for whether 
a given study should be run. For a discontin- 
ued compound, given that the project team 
has often been disbanded, there are unique 
challenges that require new ways of working 
to be established. 


Compound selection. The initial MRC and 
NIH programmes were constructed to 
include discontinued compounds as these 
provided the easiest initial path to develop 
and implement such a groundbreaking 
activity. However, the status of a compound 
can be dynamic. As an example, one dis- 
continued compound, a hormone modula- 
tor, was repositioned within the company 
for polycystic ovarian syndrome and a 
Phase II study was initiated while the MRC 
programme was being established. In this 
case, one MRC proposal was similar to the 
internal programme and led to a separate 
collaboration with a leading investigator 
in the area that focused on key scientific 
questions for the programme. It is therefore 
important in such collaborations to be 
flexible and enable a compound to be 
used in more than one collaboration or 

to be re-evaluated for development within 
the company. 
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The following general criteria were 
established by AstraZeneca and agreed to by 
the MRC and NCATS to select compounds 
for these publicly sponsored collaborative 
drug-repositioning programmes: 

° Clinical and/or preclinical evidence 

of potency, selectivity and exposure 

supporting target coverage was required 

to ensure conclusive testing of a novel 
mechanism-driven hypothesis. 

For clinical proof-of-concept testing, 

sufficient patient safety to support further 

development was required. This involved 
substantial analysis and judgment in the 
context of the indication, length of study 
or target patient population. The length 
of supporting safety studies was high- 
lighted to alert investigators of the accept- 
able length of proposed clinical studies. 

Investigators could include longer-term 

toxicology studies in their proposals if 

necessary. 

Reasonable cross-species activity and 

suitability for dosing in animals was 

required for all compounds made 
available for non-clinical studies. 

Patent life was not required. However, 

sufficient remaining patent life, new 

intellectual property or regulatory data 
exclusivity would probably be required 
to advance positive findings further. 

e There had to be no other relevant com- 
mitments or complex legal agreements 
(for example, a compound is not part- 
nered with or licensed to a third party). 

e For compounds to be used for clinical 

studies, enough drug substance to 

support a reasonably sized clinical trial 
had to be available. The same route of 
administration as previously used was 
required to avoid time, cost and attrition 
risk associated with new formulation 
development, safety studies and human 
pharmacokinetic studies. Doses were 
typically limited to those supported by 
the existing data, though in rare cases 
additional preclinical data were obtained 
as part of the programme. The existing 
clinical data were directed to new patient 
populations and requirements (for example, 
inclusion of women of child bearing 
potential), which was challenged by 
regulatory authorities at times. 

Approval for use by the company project 

team (if appropriate) and the therapy area 

head was required. 


When selecting compounds, one of the 
largest challenges faced was the collation and 
generation of the relevant datasets, particu- 
larly when project teams had been disbanded. 
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Table 2 | Key differences between the MRC and NIH-NCATS programmes* 


Characteristic MRC 


NIH-NCATS 


Therapeutic development 


Purpose Understanding the 
mechanisms of disease 
Scope Preclinical or clinical 


or both 


Proposal review 


proposals 


Industry participants 


AstraZeneca is involved 
in reviewing the concept 


AstraZeneca only 


Translational projects to hypotheses 
tested in full Phase lla trials 


The NIH performs peer-review of 
concept proposals without company 
involvement 


Eight pharmaceutical companies 
(AstraZeneca, Pfizer, Eli Lilly & 

Co., Abbott, Bristol-Myers Squibb, 
GlaxoSmithKline, Johnson & Johnson 


and Sanofi) 
Time from call for proposals 25 months (sequential 14 months (more parallel activities) 
to dosing the first subject in process) 
the first study 
Collaborative agreement Negotiated after MRC A signed agreement is required 


project approval using 
amlCRA template 


Funding 3-year award provided 
up front 
Milestone management Extendable 


Supervision 
meetings 


No required collaboration 


before the final proposal submission. 
Company template CRA agreements 
posted online 


Milestone-driven, first year provided 
up front 


Preclinical milestone is <12 months 
after funding is awarded; total time 
allocated <36 months 


NCATS convenes regular investigator— 
company collaboration meetings 


*There are also several similarities between the two initiatives, including the use of template agreements 
and of discontinued compounds only. Companies provide the compounds, regulatory documents and 
advice, and the MRC or NIH provides funding to the investigators; no funding goes to the pharmaceutical 
companies. CRA, collaborative research agreement; mICRA, model Industry Collaborative Research 
Agreement; MRC, Medical Research Council; NCATS, National Center for Advancing Translational 


Sciences; NIH, National Institutes of Health. 


For example, final analysis of histopathology 
data from a rodent carcinogenicity study of a 
discontinued project resulted in a new report- 
able finding for one compound. In another 
example, one discontinued programme was 
found to still be on partial clinical hold under 
the original investigational new drug (IND) 
application, requiring notification to the UK 
Investigator. In many cases, Investigator’s 
Brochures had not been updated and 

needed to be. One of the key learnings for 
AstraZeneca has been that every project 
should be fully closed out, including noting 
incomplete datasets, with a view to potential 
project repositioning or project reactivation. 
Final project document summaries should be 
quickly completed by the original team. 

The AstraZeneca partnership with the 
NRPB in Taiwan and the more recent 
NIH-NCATS ‘Round 2’ programmes allowed 
the inclusion of live compounds. This makes 
the collation of project data much simpler 
as there is a project team in place, although 
the dataset is also more actively evolving as 
ongoing studies read out. In addition, live 


compounds bring their own challenges in 
terms of balancing confidentiality, focus and 
intellectual property rights to new inventions. 
Often quoted is the concern that externally 
sponsored research (be it clinical or 
non-clinical) may generate data with negative 
implications for the original programme 
(for example, a new safety signal). This is best 
addressed by engagement of the project team 
and vetting of the risks versus benefits 
(risk—benefit assessment) of the additional 
indication to the entire portfolio, not just the 
single project, to avoid withholding more 
novel and interesting live compounds. 


Funding and programme management. 
Funding for the studies was provided 
directly from the public funding body to the 
principal investigator. It should be noted that 
the ‘in-kind’ costs incurred by the companies 
are not trivial, particularly if remanufacture 
of the active product, preparation of drug 
product and placebo, completion of study 
reports or regulatory documents, and/or 
patient safety coordination are needed. 


The processes and timelines for the 
implementation of the projects varied 
between the MRC and the NIH-NCATS 
programmes, which resulted in differences 
in timelines to the start of studies, particu- 
larly for clinical programmes (TABLE 2). 

This can lead to issues such as expiration 
of the clinical drug supply. Differences 
between the programmes included: 

e The NIH-NCATS programme required 
the inclusion of signed collaboration agree- 
ments with the full proposal submission, 
eliminating the post-funding delay caused 
by negotiating such agreements, whereas 
there was no such deadline for negotiated 
agreements for the MRC collaborations. 

A balance is needed between the speed 

of processing applications and reducing 

unnecessary paperwork associated with 

ultimately unsuccessful applications. 

For the MRC projects, delays were 

encountered between the approval of 

the grant and the institution receiving the 

funds. For example, some investigators 

needed to hire personnel for the work and 

the institution would not allow recruiting 

to begin until funds were received. 

e The MRC committed the full funding 
at the beginning of the grant and was 
flexible in allowing extensions of the 
grant timeline in some circumstances, 
whereas the NIH-NCATS programme 
was milestone-driven, with the first year 
of funding being provided up front and 
subsequent funding being granted based 
on the study achieving certain milestones. 

e The process in the United Kingdom for 
implementing certain clinical studies was 
more sequential (that is, various ethics, 
regulatory and study-implementation 
approvals were obtained in sequence), 
whereas investigators in the United States 
approached certain activities in parallel. 

e NIH-NCATS required some supervision 
and regular meetings to assess the 
progress of studies towards milestones. 


Nevertheless, the first clinical study 
results for the MRC and AstraZeneca 
NIH-NCATS programmes were obtained in 
the first half of 2014; one example is a mecha- 
nistic study from the MRC programme that 
evaluated the effects of a GABA, receptor ago- 
nist, AZD3355, on capsaicin-induced cough 
in healthy volunteers, and another, from the 
NIH-NCATS programme, is a Phase Ib safety 
study of saracatinib in patients with Alzheimer 
disease. Both projects progressed to the 
second phase of their proposals. Most clinical 
study results from the MRC and NIH-NCATS 
projects will be obtained in 2015-2016. 
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Collaborative working. There tend to be 
strict rules governing externally sponsored 
studies to ensure the avoidance of influence 
or bias by the pharmaceutical company over 
the independent investigator. Concerns 
regarding possible conflicts of interest are 
particularly acute for marketed products 

or compounds in Phase II development. 
These repositioning programmes did not 
include such late-stage compounds. Indeed, 
collaboration was essential to bring together 
the best scientific and clinical expertise 

to support these approaches, which were 
aimed at testing novel hypotheses. This led 
to the creation of a new AstraZeneca policy 
for collaborative discussions regarding the 
construction of full proposals for these 
investigator-sponsored studies. Beyond the 
scientific, disease and compound properties, 
others areas of discussion included the trial 
design, statistical power, patient safety and 
decision criteria. Nevertheless, because these 
remain investigator-sponsored studies, the 
ultimate responsibility and decisions for all 
aspects of the study lie with the investigator. 


Pharmacovigilance. One key element of 
these collaborative studies was to ensure 
the appropriate ownership and management 
of patient safety (pharmacovigilance). 
The responsibility for industry-sponsored 
studies rests with the company, who main- 
tain one global Investigator’s Brochure and 
a global safety database to support and 
ensure harmonization with regulatory 
and ethics safety-reporting requirements. 
For externally sponsored studies, the inves- 
tigator takes on the risk—benefit assessment 
for the study and the regulatory reporting, 
with additional requirements to report back 
to the parent company regarding safety. 
Regulatory agencies clearly devolve 
responsibility for individual studies to the 
investigator, and this raised the potential 
for lack of centralized coordination for 
discontinued compounds — a concern if 
one compound spawned multiple studies 
in different territories (for example, raising 
the question of how to effectively share an 
observation of a suspected unexpected 
serious adverse reaction (SUSAR) related 
to the compound in a US-based study 
with a UK-based investigator working with 
the same compound in a separate study). 
Therefore, AstraZeneca developed a system 
whereby the company retains ownership 
of the Investigator’s Brochure and required 
annual updates from investigators to keep 
this as one core document. Study-specific 
risk-benefit assessments are documented 
through a cover note to the Investigator 


Brochure and/or the protocol and/or the 
regulatory filing documents (investigational 
medicinal product dossier (IMPD) or IND 
documents). The patient safety database is 
owned by the company and contracted to a 
third party for maintenance, timely reporting 
from investigators and dissemination to 

all sites working with the same compound. 
Furthermore, safety data owned by the 
various investigators are made accessible 

to other investigators. Thus, AstraZeneca 
provides a central position in the cross-study 
evaluation of the safety profile and main- 
tains one harmonized and globally available 
Investigator’s Brochure. 


Intellectual property and publications. 
Frequently asked questions regarding these 
partnerships typically involved intellectual 
property (IP) and publication rights. 

In general, any existing IP remained with the 
owning party and options were established 
for the company to license any new IP gen- 
erated by the investigator. Independent of 
IP, acompany may wish to license the data 
generated by the investigator to support 
further development. For successful studies 
not further progressed by the compound 
originator, it is in the best interest of the 
company and the academic researchers to 
find a way to advance the programme for the 
benefit of patients. Regarding publications, 
the investigators retained the right to publish 
with standard provisions that provide for the 
company to review before submission. Both 
potential issues were avoided through early 
discussions regarding motivations of each 
partner when designing the programmes. 


Early indicators of success 
Although these novel models have only been 
running for a short time in the context of 
drug development, there are early indicators 
of success and benefits. Highlights include: 
° Greater sharing of valuable, and 
previously closely guarded, proprietary 
information: both the open publication 
of compound information, providing a 
single summary source of previously 
unavailable information, and the template 
legal agreements are unprecedented. 
Crowdsourcing works: there was a 
diverse range of proposals across insti- 
tutions for indications beyond those 
initially anticipated that were submitted 
within compressed timeframes. 
Improved quality of grants: the collabora- 
tive discussions during the construction 
of full proposals brought together 
academia and industry to produce higher 
quality proposals, as suggested by the 


e 


e 
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assessments given by the peer review 
committees and the 100% success rate of 
obtaining ethics and regulatory approvals 
for studies. 
Novel translational research that may 
not otherwise have been pursued: 
companies were not pursuing these 
studies and these studies would not have 
been possible for investigators to conduct 
without access to the proprietary 
compounds and public funds. 
Generation of new intellectual property: 
to date, at least two patents have been 
filed by investigators as a direct result 
of these activities. 
Two clinical studies have been completed, 
enabling progression to the next phase 
of the MRC- or NIH-funded projects. 
Spin-off proposals: proposals made 
by investigators but not funded by the 
programme still found a way of moving 
forward. In one case, AstraZeneca elected 
to fund a proposal on its own merits, and 
in another case the investigator accessed 
alternative sources of funding. 
¢ There has also been the very important 
benefit of forging closer engagement 
between the academic and commercial 
sectors, reducing the perceived barriers 
and misperceptions, and expanding the 
knowledge base of compound information. 


Saracatinib (also known as AZD0530), 
a deprioritized compound in the 
AstraZeneca portfolio, is an especially 
interesting case study for these repositioning 
programmes. It is a potent, orally bioavail- 
able inhibitor of SRC tyrosine kinase family 
members, including FYN kinase, with an 
expansive preclinical and clinical founda- 
tion of research, including Phase II studies 
ina variety of solid tumours. Ultimately, 
saracatinib failed to sufficiently modify dis- 
ease progression in these trials. The general 
and reproductive toxicology of saracatinib 
was studied in rat and dog models for up 
to 6 months. In human studies, the safety 
profile was such that further clinical investi- 
gation was possible. Although the focus had 
been in oncology, it is now recognized that 
the SRC kinase family is involved in biology 
across multiple organ systems. Between 
the MRC and NCATS programmes, five 
preclinical or clinical projects, and one 
additional spin-off project, were funded for 
further investigation across a diverse range of 
non-oncological diseases (BOX 1). Data were 
recently published for one project, showing 
that saracatinib was effective in reducing 
spatial memory defects and synaptic depletion 
in a mouse model of Alzheimer disease’. 
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These results support the evaluation of 
saracatinib in patients with Alzheimer 
disease, and clinical studies are taking 
place as part of this programme”. 

Perhaps the best indicator of early success 
is the expansion of these programmes. 

As previously mentioned, AstraZeneca and 
the NRPB in Taiwan partnered to initiate 

a similar program, and both the MRC and 
the NIH have initiated a second round 

of calls for proposals and funding. The 
MRC programme has expanded beyond 
AstraZeneca to include 68 compounds from 
7 pharmaceutical companies (AstraZeneca, 
GlaxoSmithKline, Johnson & Johnson, 

Eli Lilly & Co., Pfizer, Takeda and UCB). 
The NIH programme has expanded to 
include live development compounds and 
support for paediatric proposals. 

Ultimately, the true measure of success 
will be either positive clinical data that 
define a new mechanism involved in impact- 
ing human disease or definitive negative data 
that help to disprove a hypothesis and result 
in a redirection of attention and resources 
to more promising avenues. 


Outlook 

These pioneering government-sponsored 
drug repositioning initiatives have success- 
fully piloted a new era of open collaboration 
and innovation between academia and the 
pharmaceutical industry on translational 
research. However, these initiatives have 
only begun to scratch the surface of the 
potential opportunities to strengthen and 
expand such efforts. 

The number of compounds involved 
could easily increase by the participation of 
additional companies and incorporation 
of more live compounds, particularly those 
in early development, as is often done in the 
field of oncology. Additionally, considera- 
tion of off-target effects could be included. 
Disease-centric charity organizations that 
have focused their strategies on the transla- 
tion of research into therapies for patients 
could partner to fund programmes of inter- 
est on compounds that have been made 
publicly available. Some pioneering 
charities have embraced repositioning as 
a strategy, including the Michael J. Fox 
Foundation, which funded their first 
repositioning-focused programme in 2010, 
the Leukaemia and Lymphoma Society and 
Cancer Research UK (CRUK), although the 
opportunity remains to specifically leverage 
the compounds posted publicly. 

Start-up companies could contribute 
compounds to expand their efforts without 
financial dilution. Once a company has 


Box 1| The evolving science of saracatinib 


programmes (see the figure). 


in cancer patients with bone metastases. 


psychosis induced by infusion of psilocybin. 


larger population with mild Alzheimer disease. 


Saracatinib as a novel analgesic 
for cancer-induced bone pain 


D. Andrew, University of Sheffield, UK 


Efficacy of saracatinib in 
treatment of chronic otitis media 


M. Cheeseman, University of 
Edinburgh, UK 


SRC inhibitors as potential antipsychotics: 
human testing with psilocybin 


D. Nutt, Imperial College London, UK 


participated in one partnership the effort 
needed for additional partnerships is much 
lower. Venture capitalists could establish new 
start-up or spin-out companies based on 
either the compounds made available or the 
funded proposals. In fact, one could imagine 
a ‘marketplace’ of compounds and funders 
for investigators to access. As a result of 

the success of current pilot programmes, 
AstraZeneca has recently launched a broader 


in preclinical mouse models << Sare 


Saracatinib (also known as AZD0530) is an especially interesting case study for these repositioning 
programmes. Saracatinib inhibits the SRC kinase family, and although it was developed for 
oncology indications, it is now recognized that SRC kinases could be important in diseases related 
to multiple organ systems. Saracatinib is being investigated in six non-oncological indications 

as a result of the UK Medical Research Council (MRC) and US National Institutes of Health (NIH) 


SRC kinase as a novel analgesic for cancer-induced bone pain 

The hypothesis that SRC kinase is a crucial component of cancer-induced bone pain will be tested 
through preclinical studies in an animal model of bone cancer pain, investigating the effects of 
inhibiting SRC on pain-related behaviour, spinal cord neuron phosphorylation and signalling, 
as well as bone resorption, to identify potential analgesic mechanisms of SRC inhibition. 

A randomized controlled trial of saracatinib will investigate whether it has analgesic effects 


SRC inhibitors as potential antipsychotics: human testing with psilocybin 

Existing data demonstrate that the SRC kinase pathway is directly involved in the observable 
symptoms associated with the acute administration of hallucinogens that modulate the 5-HT,, 
receptor, such as psilocybin. This clinical study is evaluating the role of SRC kinase in blocking 


Efficacy of saracatinib in treatment of chronic otitis media 
This preclinical study will determine whether local SRC kinase inhibition moderates vascular leak 
and bulla fluid accumulation leading to reduced hearing loss in models of chronic otitis media. 


Therapeutic strategy for lymphangioleiomyomatosis and tuberous sclerosis 
Lymphangioleiomyomatosis (LAM) is a rare progressive cystic lung disease. This research team 
discovered that SRC kinase is active in LAM cells and is important for cell growth and a cell’s ability 
to move around and invade tissues. This preclinical study aims to determine whether blocking SRC 
activity is safe and can reduce the growth and the spread of LAM cells. 


FYN inhibition by saracatinib for Alzheimer disease 

FYN, a SRC kinase family member, is implicated in triggering Alzheimer disease. This study seeks 
to test the hypothesis that FYN has an important role in Alzheimer disease and that saracatinib 
provides benefit to patients with Alzheimer disease in a Phase lla clinical study. This effort has 
shown that saracatinib has beneficial effects in a mouse model of Alzheimer disease’, and a 
Phase Ib study of the safety and tolerability of saracatinib in patients with Alzheimer disease has 
been completed”. The drug is to be tested for effectiveness in slowing disease progression in a 


Therapeutic strategy for 
lymphangioleiomyomatosis 
(saracatinib) 


T. Essai, Baylor College of 
Medicine, Houston, Texas, USA 


FYN inhibition by saracatinib 
for Alzheimer disease 


S. Strittmatter et al., 
Yale University, New Haven, 
Connecticut, USA 


Multiple additional studies 


worldwide ‘open innovation initiative that 
offers a range of compounds for which 
academic investigators can submit new 
repositioning ideas and translational 
research (see Further information). This 
broad open innovation platform is the first 
to offer compounds for collaboration and 
provide a template for a true marketplace 
that invites ideas and proposals from any 
contributor and any sector. 
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A more radical model would use 
crowdsourcing during the creation of 
the proposals. At present, proposals are 
submitted by individuals. Alternatively, 
an initial proposal, submitted in an open 
manner, could then be crowdsourced for 
improvements, including alternative patient 
populations, endpoints, or trial designs. 
‘Gamificatio — the use of game thinking 
and mechanics to engage users in solving 
problems — is one potential crowdsourcing 
route wherein funders, investigators, com- 
panies and reviewers each play a different 
part. Another possibility is the platform 
developed by Transparency Life Sciences to 
develop drugs through collaborative input. 
Any crowdsourcing model would have to 
resolve intellectual property rights, deter- 
mining who performs the research and how 
credit for the final proposal is assigned. 

Barriers remain that could prevent this 
new partnership model from reaching its 
full potential. Creating broad awareness 
of the opportunity among investigators 
can be challenging. The availability of new 
compounds should increase over time; 
however, the available clinical supply of 
discontinued compounds quickly becomes 
exhausted. Resistance to the inclusion of live 
compounds remains prevalent within some 
companies and funders (although the most 
recent NIH-NCATS programme did support 
the inclusion), and the inclusion of biologics 
is challenged by the complexities of com- 
pound supply and delivery. Patent lives are 
continuously diminishing and the pursuit of 
more expensive registration studies following 
positive early studies may be deterred by the 
lack of financial return. Although regulatory 
data exclusivity provisions may provide an 
alternative to patent life, the limited term 
available in the United States is likely to be 
insufficient in many cases. 

The traditional barriers to open innova- 
tion have only just begun to be addressed. 
Supplemental resourcing, especially funding, 
is still needed to justify the pursuit of certain 
areas, including those involving rare and 
niche indications or those compounds with 
a limited remaining patent life. If projects 
are successful, the appropriate distribution 
of royalties must recognize the relative con- 
tributions of each party. Even greater trust 


in sharing between parties can be achieved, 
including sharing scientific information, 
patient safety monitoring and scientific 
credit for innovative advances. 
Nevertheless, the achievements of these 
pioneering initiatives include important 
lessons learned for all parties involved and 
advances that tackle many of the key barriers. 
The question confronting all parties now is: 
will we move forward against any remaining 
hurdles and advance this model or will we 
revert to the historical more ‘closed’ transla- 
tional research and collaboration models? 
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Towards a hit for every target 


The advent of high-throughput screening (HTS) in 
the 1990s revolutionized the search for lead molecules. 
It enabled screening of large, diverse compound libraries 
against drug targets to identify chemical probes for target 
validation and drug candidates for clinical development 
on a scale not previously envisaged. However, the suc- 
cess of HTS was limited by the quality and diversity of 
the screening libraries and by the nature of many of the 
assay technologies, which required compound testing in 
artificial biological systems’. 

Twenty-five years on, we believe we are on the cusp 
of a new era in HTS, in which its transformation from 
a ‘numbers game’ to a smart selection tool will be com- 
pleted. This transformation is being driven by recent 
advances in computational and medicinal chemistry, cell 
biology, biophysics and instrumentation, which together 
could enable the design of bespoke screening libraries for 
each drug target that can be tested in information-rich 
biophysical assays and advanced cellular models. The 
innovations now being implemented could deliver on the 
long-sought promise of a hit for every target, meeting 
the strategic need to look beyond traditional drug target 
families for the next generation of medicines. Furthermore, 
a range of innovative partnerships mean that in this new 
era, lead discovery based on HTS will no longer be the 
preserve of pharmaceutical companies, but will become 
commonplace in the academic research community. 


Enhanced screening technology 

For much of the past 20 years, pharmaceutical compa- 
nies have used a target-class approach to drug discovery, 
focused on screening members of tractable protein 
superfamilies such as G protein-coupled receptors 
and kinases. A major strategy was to identify chemical 
probes to be used for associating the targets with dis- 
ease.There is now a drive towards the selection of drug 
targets with increased levels of target validation before 
a screening programme is started’. This has led to an 
increase in the diversity of target types and biological 
pathways to be screened, and includes a renewed interest 
in phenotypic screening. 

However, advances in assay biology and screening 
technology have been slow. Testing compounds against 
recombinant proteins or cell lines in fluorescent, lumin- 
escent or radiometric assays in microtitre plate format 


Steve Rees, Philip Gribbon, Karen Birmingham, William P. Janzen and Garry Pairaudeau 


Technological advances coupled with novel collaborative strategies for compound sourcing 
and management are poised to transform the utility of high-throughput screening. 


has been the workhorse of the field. We believe this 
situation will evolve. Stem-cell and primary-cell tech- 
nology, together with 3D and microphysiological cul- 
ture systems, will increasingly enable screening in more 
disease-relevant assays. The use of ‘omics’ platforms 
(transcriptomic, proteomic and metabolomic), in which 
multiparametric data from complex assay systems are 
collected, will become commonplace, and advanced data 
capture and analytics tools will be used to interpret the 
large volumes of data. Finally, new technologies, includ- 
ing acoustic delivery of samples for mass spectrometry, 
will enable novel screening approaches. 


Improved compound library quality 

A typical pharmaceutical company compound collec- 
tion consists of ~1-3 million molecules selected for their 
drug-like or lead-like properties’. These collections rep- 
resent the chemical history of companies based on their 
historical portfolio of internal projects, directed efforts 
to target libraries to specific areas of biological relevance, 
and synthesis or acquisition based on chemical diversity 
and synthetic tractability. 

Success in screening against novel target families 
requires that we expand into new chemical space and 
new therapeutic modalities, yet the cost of acquiring or 
making large numbers of new compounds is prohibitive 
even for large pharmaceutical companies. Moreover, 
practical considerations such as storage volume, store 
size and screening costs limit the number of compounds 
in a collection. We believe that these constraints can be 
addressed through a series of imminent innovations. 

First, knowledge-based selection of compounds for 
screening is limited to target classes for which knowl- 
edge (typically structure-derived) of desirable pharmaco- 
phores is available, such as kinases, and the size of 
such sets is limited by the capacity of available com- 
pound-picking robots. In the coming years, technologies 
such as cryo-electron microscopy and advances in the 
ability to crystallize membrane proteins should rapidly 
expand knowledge of the structure of putative drug bind- 
ing sites. Combined with improved computational tools 
that enable more effective interrogation of chemical space 
and the development of capability in compound manage- 
ment to create bespoke screening libraries, it will become 
possible to create knowledge-based screening libraries 
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for many targets. Delivery of this vision will require the 
development of new high-throughput synthetic methods 
to create compounds for screening. 

Second, the ability to dispense nanolitre volumes of 
a compound solution directly from low-volume storage 
tubes to assay plates using acoustic technology will 
revolutionize compound library design, storage and 
distribution through miniaturization. It will enable the 
screening of compounds generated by expensive or low- 
yield syntheses and will reduce storage costs. The ability 
to rapidly pick compounds will support the creation of 
target-specific compound libraries and iterative screen- 
ing paradigms. This could have a particular impact in 
the academic setting, as publicly sourced compounds are 
typically available in small amounts. 

Third, compound sharing creates the opportunity 
to rapidly increase library diversity. For example, 
AstraZeneca has recently made agreements with Sanofi 
and Syngenta to exchange 240,000 novel compounds for 
screening at no cost. Many examples of sharing exist with 
and between academic centres, including Sanofi and 
Evotec, Sanofi and Fraunhofer’s Center of Excellence for 
Natural Products Research, and the University of North 
Carolina's agreements with Southern Research Institute, 
the National Center for Advancing Translational 
Sciences and the LIMR Chemical Genomics Center. 
Another model of compound sharing is the collabor- 
ation between AstraZeneca and Bayer, whereby each 
company makes available its entire compound library 
for screening of the partner's target. 

Finally an increased interest in screening biological 
molecules, including peptides and secretome-, siRNA- 
and CRISPR-based libraries will require the develop- 
ment of new methods for sample storage and supply to 
complement small-molecule screening. 

Only the largest organizations could create these 
capabilities in-house. The smartest will create networks 
of shared capabilities to match the right drug target to the 
right compounds. 


Succeeding through partnership 
There has been a recent and dramatic increase in 
the number of academic drug discovery centres; the 
Academic Drug Discovery Consortium currently 
includes 136 centres. This increase has been driven by 
growing interest in identifying small-molecule probes to 
explore biology and in therapeutic discovery, as well as 
a strategic intent from funders in translational science. 

In parallel with the changes in academia, there has 
been a shift in the mindset of some companies to embrace 
‘open innovation”, which recognizes that no organization 
can succeed alone and partnerships based on complemen- 
tary investment and shared success are needed to deliver 
novel medicines. In hit discovery, this requires companies 
to make their libraries and screening infrastructure avail- 
able to partners, and for academics to share target and dis- 
ease knowledge and expertise across multiple institutions, 
including companies. 

Some examples of this paradigm already exist. For 
example, the creation of the European Lead Factory 
(ELF) involved the donation of compounds from seven 


companies to create the 270,000-compound Joint 
European Compound Library (JECL), which is being sup- 
plemented by 230,000 novel academic compounds. The 
EU-OPENSCREEN project aspires to create a network of 
specialist laboratories sharing a common infrastructure 
and compound library, where users will be mandated to 
make project bioactivity results available to the broader 
scientific community. 

Several companies are making all or part of their 
compound library available to academia through open 
innovation collaborations. For example, AstraZeneca’s 
MRC Centre for Lead Discovery will see scientists 
from the Medical Research Council (MRC), Cancer 
Research UK (CRUK) and AstraZeneca working side by 
side to advance industrial and academic targets using 
next-generation compound management and screening 
platforms. AstraZeneca will make its hardware, compound 
library and scientific expertise available to academia; the 
MRC and CRUK will identify innovative drug targets 
from their principal investigator networks. 


Training and skills 

The skills and knowledge in compound management and 
HTS that were traditionally the preserve of large pharma- 
ceutical companies and contract research organizations 
are increasingly needed in smaller companies and aca- 
demia. Until now, such expertise has largely come from 
the release of experienced staff from industry who have 
moved into academia. However, the explosion in the 
number of academic drug discovery groups is creating a 
skills shortage at the same time as generating a solu- 
tion. Many young scientists are completing their formal 
training with experience in these areas gained through 
collaborations with academic drug discovery centres. 
The remaining gap is the formalization of these training 
programmes, which must be addressed by funding bodies. 
Increasingly, organizations such as the Society for 
Laboratory Automation and Screening are facilitating 
professional programmes to train and develop the coming 
generation of hit discovery scientists. 


Conclusion 

Attaining the holy grail of a hit for every target depends 
on new ways of working and on technological and scien- 
tific advances. Industry and academia must invest in their 
respective strengths of lead and target identification and 
both must open their doors and their expertise to partner- 
ships based on complementary investment and shared 
success. Funding bodies should foster such collaboration 
and companies should create innovative win-win deal 
structures to encourage such partnerships. 
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The global intellectual property landscape of induced 
pluripotent stem cell technologies 


MacKenna Roberts}, Ivan B Wall!-4, Ian Bingham®, Dominic Icely*, Brock Reeve®, Kim Bure’, Anna French)!° & 


David A Brindley! 8-10 


Will freedom to research and innovate be restricted as the induced pluripotent stem cell field advances toward the 
clinic, or are concerns premature within a rapidly changing ecosystem? 


ntellectual property (IP) rights lie at the core 

of the commercialization process, serving as 
a powerful incentive to harness the potential 
of technologies for therapeutic applications. 
However, when filed inappropriately—with 
broad or premature claims, for example—or 
when mismanaged, patents can obstruct vital 
precompetitive collaborations, dampen inves- 
tor interest and threaten clinical translation 
and patient access!~3. Moreover, uncertainties, 
including those around the validity of claims 
or the priority of ownership rights, can depress 
progress’. 

Stem cell derivation and cellular 
reprogramming techniques are broadly 


1Oxford-University College London Centre 
for the Advancement of Sustainable Medical 
Innovation, University of Oxford, Oxford, UK. 
*Department of Biochemical Engineering, 
University College London, London, UK. 
3Department of Nanobiomedical Science and 
BK21 Plus NBM, Global Research Center of 
Regenerative Medicine, Dankook University, 
Cheonan, Republic of Korea. “Biomaterials 
and Tissue Engineering Lab, Department 

of Nanobiomedical Science and World 

Class University Research Center, Dankook 
University, Cheonan, Republic of Korea. IP 
Asset LLP, Oxford, UK. ©The Harvard Stem 
Cell Institute, Cambridge, Massachusetts, 
USA. ’Sartorius Stedim, Géttingen, Germany. 
8Nuffield Department of Orthopaedics, 
Rheumatology and Musculoskeletal Sciences, 
Nuffield Orthopaedic Centre, University of 
Oxford, Oxford, UK. Centre for Behavioural 
Medicine, UCL School of Pharmacy, University 
College London, London, UK. 10These authors 
contributed equally to this work. 

e-mail: anna.french@casmi.org.uk or 

david. brindley@ndorms.ox.ac.uk 


742 


enabling technologies for which ready access 
is essential to a sustainable cell therapy 
industry. Pluripotent stem cells (PSCs) have 
the potential to transform future healthcare. 
Clinical trials using embryonic stem cells 
(ESCs) have begun to show early promise”; 
however, concerns remain regarding limited 
supply and ethics. In 2007, scientists at Kyoto 
University and, independently, at the University 
of Wisconsin, reprogrammed adult human skin 
cells to generate induced pluripotent stem cells 
(iPSCs)®7. In addition to long-term therapeutic 
potential, iPSC technologies are valuable tools 
for drug discovery, toxicology testing and 
disease modeling. 

Although the total number of stem cell 
patent filings has declined since 2008, patents 
for iPSC technologies continue to increase’. 
Analysis of the blistering pace of scientific 
progress in this area indicates that the growth 
is unlikely to abate. Widespread concern has 
been voiced that the emerging ecosystem 
is becoming burdened by prohibitive and 
cumulative licensing fees that could restrict 
scientists’ freedom to research and patients’ 
equitable access to resulting medical 
benefits*-*®?. Although these concerns may 
be allayed through innovation and industry 
growth cycles, the upstream production and 
downstream differentiation of iPSCs into 
desired cell lineages for application requires 
numerous interrelated, complex technologies. 
This distinguishes cellular patents from the 
evolution of other highly patented industries 
such as small-molecule drugs and electronics. 
No single company currently controls the IP for 
all techniques, methods and reagents required 
for the production of iPSCs!°. A global race 
is underway to establish the most suitable and 
efficient methods for each of these component 
technologies. 


Although it may be beneficial for competing 
industry that no single entity dominates 
market access to all fundamental iPSC 
technologies, congestion of patents related 
to broadly enabling technologies, referred 
to as a ‘patent thicket, has the potential 
to bottleneck innovation. If ownership of 
overlapping fundamental aspects is spread 
across several patent holders, research may 
become encumbered by the negotiation 
of complicated stacked licensing with 
prohibitive fees and royalty schemes that 
drain value from innovation. This is referred 
to in IP rights theory as ‘anti-commons’ 
innovation!!. Funders may, consequently, be 
deterred by vulnerability to future challenges 
against IP ownership of a product after costly 
development. 

Analysts estimate that total global revenues 
for companies that supply iPSC research 
products was $837 million in 2012 and that 
this will exceed $1 billion by 2015 (ref. 12). 
Additionally, 22% of stem cell researchers 
report having used iPSCs!*. With first-in- 
man clinical research studies underway this 
year at RIKEN‘, now is a critical time in iPSC 
technology R&D to address the validity of 
any potential IP obstacles to the commercial 
translation of medical advancements. 

Despite extensive literature, a current, 
global analysis that focuses exclusively 
on the burgeoning IP landscape for iPSC 
technologies has not been performed. 
To provide clarity, we consider the broad 
landscape and discuss the validity of 
mounting concerns over licensing practices 
for this nascent ecosystem, possible solutions 
and the emergence of other forms of IP that 
could erect barriers to open and collaborative 
science. Accordingly, we have employed 
commercial software to analyze patent 
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iPSC main technology categories 


Production Differentiation 


Estimated no. of patent families (patented inventions) 


data, which identified a fragmented patent 
landscape with internationally dispersed 
ownership around complex, interrelated 
technologies. 


Technology areas and trends 

To identify the major trends, a raw data set 
was harvested to include a broad range of 
technologies related to iPSCs, comprising 
4,651 documents with 1,388 Derwent World 
Patents Index (DWPI) families (patented 
inventions) using Thomson Innovation 
software. 
characterized and cleaned, both manually 
and using Thomson Data Analyzer software 
(see Supplementary Note). Patents filed 
and/or granted between 1 September 2006 
and 31 December 2013 were searched using 
keywords in patent titles, abstracts and claim 
sections across a complete global data set of 
50 patent authorities. 


This initial collection was 


The challenge in isolating the iPSC patent 


650 1,300-1,400 


Differentiation technologies are the 
largest group of iPSC patents. In this broad 
category of patents, referred to as the 
‘differentiation patents, iPSCs have a range of 
roles, from central to more ancillary. In the 
latter, pluripotent cells are cited merely as a 
possible starting material for the invention, 
among other listed cell types. Including 
differentiation patents within the iPSC 
landscape initially gives a broad overview with 
considerable overlap with the wider PSC (iPSC 
and ESC) landscape. A conservative estimate 
of the number of differentiation patents is 
1,300-1,400 patent families (inventions) with 
4,500-5,000 documents overall (Table 1 and 
Supplementary Table 3). This figure would 
probably be higher if more differentiation 
patents for ancillary technologies that cite 
iPSCs in their claims were included. 

The category of patents related to iPSC 
production, an estimated 650 filings, includes 
culture techniques, growth factors and 


PATENTS 


technologies essential to the generation of 
iPSCs (Table 1 and Supplementary Table 3). 
Estimates do not account for discontinued 
or denied applications. Excluding ancillary 
technologies such as culture techniques, we 
identified 299 patents for reprogramming 
methods. 

A digital representation of the broad 
iPSC patent landscape provides a 
macroscopic glimpse of patenting trends 
and can help visualize technology clusters 
(Fig. 1). Automated software clusters similar 
terminology found in the patents’ titles, 
claims and abstracts. Patents are represented 
by dots, and those with similar terminology 
are delineated by contour lines. The greater 
the similarity in the patents’ terminology, the 
greater the elevation of the cluster contour. 
The white ‘snow-capped’ peaks represent areas 
of high patent activity for the labelled terms: 
multiple sclerosis, spinal injury and diabetes 
research; cancer research; reprogramming 
methodologies; gene therapy and tissue 
engineering; and cell culturing mediums 
and techniques, including assays to confirm 
pluripotency and cell population selection 
techniques (Fig. 1). 

Although a number of the peaks involve 
cell-reprogramming techniques distributed 
among a range of technology areas 
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landscape from the broader pluripotent 
landscape (Supplementary Fig. 1) is in 
the high degree of overlap in terminology, 
techniques and applications that relate to 
both ESCs and iPSCs. Another particular 
challenge is the significant overlap between 
cellular processes and materials claimed for 
reprogramming and differentiation. 

To narrow the search within pluripotency, 
we selected more rigorous keyword searches 
(Supplementary Table 1) to yield data sets 
that were further manually refined through 
random sampling of more than 700 patents 
(Supplementary Table 2). The data set 
therefore represents an  over-inclusive 
estimation rather than a precise figure. 

The iPSC R&D pipeline is divided between 
upstream production and downstream 
differentiation techniques. Production refers 
to the reprogramming of somatic cells to a 
pluripotent state. Once reprogrammed, these 
cells are selected and replicated. Candidate 
pluripotent cell colonies are then characterized 
for confirmation of a pluripotent phenotype 
before further propagation. The resulting 
iPSCs can serve as important research tools 
for a range of downstream applications that 
differentiate iPSCs into specialized cells. The 
differentiated cells created from PSCs can then 
be used for drug screening, toxicity testing, 
disease modeling or therapeutic application. 


Figure 1 Broad iPSC technology patent landscape. The five white ‘snow-capped’ peaks indicate 
technology areas of high patent activity: cell culture (including cell selection and characterization 
techniques), spinal injury, cancer (involving cancer cells, inhibiting teratoma formation or research 
intended as a therapeutic indication toward cancer), reprogramming methodologies and patent 
documents that cite gene therapy or tissue engineering purposes. Each dot represents a patent. 

The stronger the correlation of automatically clustered terminology within a contour, the higher the 
elevation of related patents. The patent map, generated using Thomson Innovation software, provides 
a representative overview of the original raw patent collection before it was cleaned to remove patents 
that do not directly claim iPSCs as a central aspect of the patented invention. This initial raw data set 
of 1,388 patent families with 4,651 total documents comprises both granted and pending applications 
from 1 September 2006 to 31 December 2013. (Source: Thomson Reuters) 
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Figure 2 Trends in iPSC patent document filings. (a) The number of total patented inventions filed for 
iPSC technologies has increased each year since 2006, peaking in 2012. (b) National contributions to 
iPSC invention by earliest priority country. The United States dominates the fundamental technology 
platforms in the field, followed by Japan, as could be predicted on the basis of technology origin and 
subsequent national focus. China and the Republic of Korea significantly contribute in addition to the 


European Patent Office and the United Kingdom. 


(Supplementary Fig. 2), the map flushes 
out a visual of the third-largest category of 
iPSC-related patents (generalized from the 
size of the snow-capped peaks in Fig. 1): cell- 
culturing techniques and growth factors. This 
category includes technologies for both the 
generation and differentiation processes. 
Each of these broad categories represents a 
technology umbrella for a series of complex, 
multistep processes. Therefore, each step in 
the iPSC ‘pipeline’ further subdivides into 
specific areas of research and IP proliferation. 
Because accepted best-practice standards 
within the industry for each of these steps 
do not yet exist, they represent current 


and potentially profitable technology gaps 
but, equally, concerns for patent thicket 
formation. 


Patent filing profile 

Published patent applications filed on 
iPSC-related technologies nearly doubled 
from 2009 to 2010 (Fig. 2). In 2012, Shinya 
Yamanaka shared the Nobel Prize in 
Physiology or Medicine with John Gurdon 
for their contributions to reprogramming. In 
the same year, published patent applications 
relating to iPSCs reached an all-time high 
(Fig. 2). Despite a modest dip in 2013, a 
similar number (150) of new filings on 


discoveries broadly relevant to iPSCs can be 
expected in 2014. This trend is unlikely to 
decrease, provided a nonobvious, novel step 
change in reprogramming standards is not 
discovered. 

Patent activity was analyzed by 
priority country to indicate geographical 
concentration of iPSC research. The United 
States and Japan clearly lead, with the greatest 
contributions to iPSC patented inventions, 
but research is active on a global stage 
(Fig. 2). In 2013, Japan passed legislation 
to expedite approval of candidate stem cell 
therapeutics without phase 3 clinical trials)’. 
If this regulation results, as anticipated, in 
the acceleration of therapeutic products to 
market, the corresponding patents may also 
be expedited, as was the grant of Yamanaka’s 
patent in 2008. However, this remains to 
be seen. Meanwhile, the United States has 
steered in a different direction. The federal 
appeals court ruled in February 2014 that 
more-than-minimally manipulated stem cell 
therapies must be regulated as a (biological) 
drug in accordance with US Food and Drug 
Administration regulations, rather than more 
leniently as a medical practice or medical 
device (cellular product)'*. 

Still, the United States, with its significant 
margin for pending and issued patents, 
is unlikely to be surpassed by Japan. 
Additionally, the United States has a more 
diverse base of assignee expertise compared 
to Japan’s concentration in a handful of 
institutions (Table 2). However, in an 


Table 2 Leading 10 assignees and inventors (924 patent family cleaned subset of collection) 


Rank Organization Top inventors 

il Kyoto University (Kyoto, Japan) Yamanaka, S., Takahashi, K., Nakagawa, M. 

2 University of California, Santa Cruz (Santa Cruz, California) A 

2 General Hospital Corp. (Boston) Hochedlinger, K., Maherali, N. 

2 Harvard College (Boston) aherali, N., Hochedlinger, K. 

4 National Institute of Advanced Industrial Science and Technology Yamanaka, S., Goshima, N., Mochizuki, H., Kawamura, Y., 
(Kyoto, Japan) Maekawa, M. 

4 Whitehead Institute of Biomedical Research (Cambridge, Massachusetts) Jaenisch, R., Hochedlinger, K., Young, R.A. 

4 Wisconsin Alumni Research Foundation (Madison, Wisconsin) Thomson, J.A., Chen, G., Yu, J. 

5 Cellular Dynamics International (Madison, Wisconsin) Mack, A., Yu, J., Thomson, J.A. 

B) Guangzhou Institutes of Biomedicine and Health (Guangzhou, China) Pei, D.Q., Esteban, M., Quin, D.J., Chen, J.K., Yang, J.Y. 

6 Agency for Science, Technology & Research (Singapore) g, H.H. 

6 Salk Institute for Biological Studies (La Jolla, California) Belmonte, J.C.|., Gage, F.H., Raya, A. 

6 Stanford University (Stanford, California) NA 

vi Advanced Cell Technology (Marlborough, Massachusetts) Lanza, R., West, M.C., Lu, S.J. 

7/ Sangamo Biosciences (Richmond, California) Holmes, M.C., Gregory, P.D., Urnov, F. 

vi University of Tokyo (Tokyo) akauchi, H., Takayama, N., Eto, K. 

8 Johns Hopkins University (Baltimore, Maryland) A 

9 Keio University (Tokyo) Okano, H., Fukuda, K., Hattori, F. 

10 Children’s Medical Center (Boston) Daley, G.Q., Agarwal, S., Park, |.H. 

10 Primegen Biotech (Irvine, California) Silva, F.J., Sayre, C.B. 


Only inventors with three or more patents are named. NA, not applicable. 
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Figure 3 The distribution of ownership origination shows a fragmented landscape with public-sector 
dominance and a growing cottage biotech industry. (a) Breakdown of sector participation in iPSC R&D 
shows a significant number of iPSC patents in government, nonprofit and academic institutions (58%). 
The public sector dominance is unsurprising for emerging technologies. However, 34% corporate 
ownership indicates that this field has started to actively privatize at an early stage. (b) Distribution of 
patented inventions among patent holder organizations. The domination of lower-value patent portfolio 
holdings shows a large number of smaller players in the sector, indicating that the 34% corporate face 
of iPSC patent ownership is composed largely of small biotechnology companies. The thin distribution 
of patent ownership reveals a highly fragmented landscape. (c) The ten leading organizations, 
determined on the basis of overall patent-document numbers for iPSC inventions, shows Kyoto 
University as the top iPSC patent holder (both for overall applications and granted patents). Despite 
the overall dominance of the United States, Japan has two institutions, Kyoto University and the 
National Institute of Advanced Industrial Science and Technology, in the top ten organizations for iPSC 
inventions, in addition to the Guangzhou Institutes of Biomedicine and Health in China and the Agency 


for Science, Technology and Research in Singapore. 


assessment of patents’ value to the field 
through citation analysis compared with the 
location of the top assignees (patent holders), 
the geographical hubs for iPSC research are 
Japan, Massachusetts, California, Wisconsin 
and the United Kingdom (Table 2, Table 3 
and Fig. 2). 

China and South Korea both have strong 
footing in the top five countries filing 
iPSC patents (Table 2 and Fig. 2). Their 
positioning is stronger for iPSCs than for 
PSCs or general stem cell patents!>. In terms 
of patent filing numbers, China appears to be 
the third strongest contributor to iPSC-related 
inventions exceeding the European Union 
(EU). The EU has a similar patent activity level 
to South Korea (Fig. 2). This distinguishes the 
iPSC patent landscape. By contrast, the EU is 
the third highest contributor with regard to 


patents filed for embryonic stem cells discovery 
and stem cell research more generally. Although 
these geographical comparisons include more 
ancillary iPSC technologies and do not account 
for validity of the patents filed, they clearly 
indicate greater global participation than other 
areas of stem cell research!>. 

After the United Kingdom, Germany is the 
top patent-filing country in the EU for all stem 
cells!>. For the iPSC patent landscape, the United 
Kingdom remains the strongest patenting EU 
member state. Germany is active in the PSC 
space but does not seem currently to have a 
leading presence in iPSC technology (Fig. 2). 
German institutions, however, are the second 
largest contributors, behind France, of regional 
EU patent applications. It is possible that a 
stronger presence has yet to emerge owing to the 
year-and-a-half time from filing to publication. 
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Even in the broader pluripotent cell 
landscape, Kyoto University is the established 
leader among patent applicants!>. However, 
until as recently as 2012, the Wisconsin 
Alumni Research Foundation was the 
institution granted the most stem cell 
patents!>. Unsurprisingly, in the iPSC space, 
Kyoto University leads for total filed and 
granted patent applications. Unlike the 
majority corporate ownership of stem cell 
technologies!°, iPSC technologies are emerging 
and, therefore, still dominated by public sector 
R&D, with only 34% of patents owned by 
corporate entities (Fig. 3a). 


Patent distribution and patent thickets 
Patent portfolios remain small and thinly 
distributed among many entities: 66% 
of applicants own fewer than five patent 
applications, and 32% have one patent 
application (Fig. 3b). IP dominance over 
iPSC technologies is established by relatively 
small holdings of six to more than 21 filings 
in a portfolio (Fig. 3b). The total number of 
patents owned may be higher than shown, 
because an assignee may hold several patents 
worldwide for any single invention. Here, we 
restricted analysis to the number of inventions 
to determine an entity’s impact as well as 
portfolio diversity. 

The ten leading assignees and inventors at 
leading institutions (Fig. 3c) reveal a significant 
overall international contribution to the field, 
indicating a global dispersion of basic iPSC 
technologies. 

The growing disparity between granted 
and pending applications further clouds the 
certainty of what parties may validly claim 
(and license) which technologies. Regulatory 
and legal challenges have contributed to an 
increased delay in examination, particularly 
before the European Patent Office, although 
granted patents are still subject to challenge. 
Only 11% of patents are issued, and an 
overwhelming 89% of applications await 
examination (Fig. 4). 

This distribution pattern is typical of an 
emerging technology. However, thin patent 
distribution among industry also warrants 
concern for patent-thicket formation. Strong 
competition by many actors in the field can fuel 
developments, but it also indicates that the task 
of coordinating access to interrelated enabling 
technologies might require an onerous and 
complex licensing scheme. 

Despite the public sector’s dominance, 
a substantial number of smaller biotech 
companies occupy the other end of the filing 
spectrum. Many of these have filed only one 
iPSC-related patent. The 34% contribution 
from the private sector therefore constitutes 
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Figure 4 The percentage of patents granted globally is low and varies between countries. Only 11% 
of all filed iPSC-related patents have been granted, leaving 89% pending examination, adding to the 
uncertainty of who will hold rights in what technologies. The ratio of iPSC granted patents to pending 
applications for each of the leading five patenting nations and the EU (through its regional patent 


application process) by priority country is shown. 


a large number of companies holding a low 
number of patents (Fig. 3a). Further in-depth 
analysis of company profiles and collaborations 
would probably prove to be of interest. 

This analysis does not reflect the 
licensing of portfolios to ‘exploitation 
companies’ that privately manage the 
licensing arrangements for a range of patent 
holders. By way of example, iPS Academia 
Japan (iPS AJ) is a subsidiary of Kyoto 
University that is privately funded and has 
widely sublicensed iPSC patent portfolios 
from several universities and research 
institutions in Japan and beyond, including 
Axiogenesis AG of Cologne, Germany, and 
Auckland UniServices Ltd. of New Zealand. 
Its total licensed patent portfolio includes 
an estimated 74 families (an estimated 
280 applications with 70 patents granted 
worldwide). iPS AJ manages IP for iPSC- 
related technologies, cellular products, 
drug discovery and cell therapy studies. 
All licenses are nonexclusive; those to 
universities are on a royalty-free basis, and 
those to industry are royalty bearing. It also 
actively fosters international alliances and 
collaborative R&D for therapeutic benefit. 


In this way, rather than relying on stretched 
technology transfer offices that may lack the 
expertise, private IP collectives could compete 
to attract portfolio management, consolidate 
portfolio holdings under one roof and improve 
licensing transparency. Another clear solution 
to negotiating a fragmented landscape is to 
‘bundle’ related rights to platform technologies 
through cross-licensing schemes. These 
cross-licensed rights are then available for 


Number of citations for 


license as a ‘package’ by a single provider. The 
management and expertise required for such 
patent bundling may still best be handled by a 
company or clearing house, however. 

The reasons for patenting an array of 
methods for the same general technology, such 
as upstream iPSC production, are multifactorial 
and do not always indicate a thicket. Inventive 
steps sometimes address different processes, 
stages or mechanisms of action that achieve the 
same function. Alternatively, many patents may 
seem to focus on the same specific technology 
in part because working with multiple cell 
types across different species presents unique 
challenges. The validity of independent claims 
to the same iPSC technologies is nonetheless 
becoming increasingly murky. A splintered 
approach to developing platform technologies 
is currently in progress and reflected by the 
rapidly increasing entanglement of IP among a 
web of assignees. Although the conditions for a 
patent thicket exist within the iPSC innovation 
ecosystem, there are a number of reasons why 
escalating numbers of dispersed patents may 
not signify the erection of an anti-commons 
obstacle. 


Next-generation technology 

The first methods of generating iPSCs involved 
retroviral delivery of reprogramming factors 
through random integration and were therefore 
deemed unsafe for therapeutic application. 
Recent patents claim substantially different 
methods of and materials for delivering 
reprogramming factors. For example, small- 
molecule reprogramming, as recently described 
in mouse cells!®, represents a breakthrough 
technology that may be unaffected by any patent 
thicket around reprogramming technologies!” 
Unless held to be an obvious progression from 
previous knowledge, the technology described 
in these later patents should be able to operate 
freely with respect to the same technology— 
production of iPSCs from somatic cells. 


Number of patents with 


Assignee most cited patent >5 forward citations 
Kyoto University 104 o) 
University of California g) 2 
General Hospital Corp. 20 2 
Harvard College 20 3 
Whitehead Institute of Biomedical Research 18 5 
Wisconsin Alumni Research Foundation S7/ 4 
National Institute of Advanced Industrial 3 (0) 
Science and Technology 

Cellular Dynamics International 12 3 
Guangzhou Institutes of Biomedicine and Health 2 0 
Advanced Cell Technology iL) 5) 
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How the field will develop, and whether 
next-generation technologies must 
license extensive portfolios or will avoid 
infringement of prior patents, remains to be 
seen. As discussed, the patent diaspora may 
merge naturally through the cross-licensing 
of readily available commercially bundled 
technologies. As therapies or products 
near market, industry-led mergers and 
acquisitions or strategic partnerships may 
also develop a coherent IP infrastructure. 
Consolidation of the IP landscape, through 
market trends and industry growth, for 
better-resourced innovation has yet to take 
shape. The possibilities suggest that concerns 
are premature and the iPSC landscape may 
merely reflect a normal early growth cycle 
within a dynamic industry. 

However, practices can be implemented 
to foster the incentives the patent system 
offers while protecting translation from 
patent mismanagement. Improved 
transparency through the creation of a 
centralized IP registry that would track 
current nonconfidential licensing practices 
would better inform any freedom-to-operate 


concerns?. 


The wider context of iPSC technologies 
The validity and quality of individual claims 
asserted by patents is not assessed here. Even 
the most fundamental iPSC patents granted, 
first in 2008 by Yamanaka in Japan and then 
in 2010 to Kazuhiro Sakurada in the United 
Kingdom, have not escaped challenges 
to validity by patents held by Rudolf 
Jaenisch!®!°, which significantly predate 
them. Nonetheless, the patent landscape 
cannot be understood in isolation from its 
greater legal, scientific and regulatory context, 
in which unique challenges exist for the cell 
therapy industry as a whole. An appreciation 
of these issues is vital to the understanding of 
the complexity of claiming proprietary rights 
in living biomaterials from donors. 

A number of recent judicial decisions in 
the United States and Europe potentially 
limit the scope of IP rights claimed for stem 
cells, but iPSC-related inventions remain 
actively patent eligible. Notably, however, the 
EU awaits express judicial confirmation this 
year that iPSCs do not fall within its broad 
interpretation of ‘human embryo by inducing 
germ and other cells into an embryonic-like 
state??, 

The US Supreme Court decision in 
Association for Molecular Pathology et al. v. 
Myriad Genetics, Inc., et al. invalidated patents 
on isolated DNA sequences, causing concern 
that patents claiming ownership of a donotr’s 
isolated stem cells may be refused on similar 


‘product of nature’ grounds. A challenge to 
the broad ESC patents held by the Wisconsin 
Alumni Research Foundation was then 
reinvigorated in the United States”!. iPSCs 
are less susceptible to these patentability 
concerns. Although more recent methods do 
not rely on genetic manipulation, complex 
steps chemically manipulate the cells so 
that they function in a manner that does 
not occur in nature. Thus, more akin to 
recombinant DNA, iPSCs are likely to be 
patentable as both process and composition- 
of-matter (cell) claims; unless the latter is 
deemed legally excluded subject matter. 
Likewise, because iPSCs are not sourced from 
human embryos, they present fewer ethical 
challenges. The controversy surrounding 
human ESCs has confused and cast aspersions 
on iPSC patentability. Greater legal clarity, 
beyond Europe!®, is required to overcome 
misperceptions. 


IP barriers and solutions 

IP support currently available is likely to be 
insufficient for the large number of small 
biotech companies, for which the main issue 
may be initial stacked licensing fees. Typically, 
these smaller companies with undeveloped IP 
portfolios will not have the resources for big 
pharma-level corporate licensing schemes or 
litigation challenges from biotech goliaths. It 
is, however, anticipated that these issues are 
temporary until more iPSC technologies 
are ready for commercialization. As a field 
matures, industry partnerships and buyouts 
typically consolidate IP and provide necessary 
resources. 

Most collaborative efforts have addressed 
‘open science’ sharing of research materials 
and data but have neglected to address a 
need for better transparency in licensing 
practices. Although it is unlikely that patent 
holders will altruistically donate patents 
to a true open-source licensing collective, 
other commercially reasonable alternatives 
exist to foster the iPSC business community. 
Transparent commercial licensing practices 
could be established through the creation 
of online regional registries or databases, a 
global IP exchange or, more comprehensively, 
informal ‘clearinghouses’ for holders 
of stem cell-related patent to register 
ownership and its nonconfidential trading 
activity**, The clearinghouse model has 
been implemented in the public sector for 
agricultural biotechnology”. New licensing 
models such as the ‘Easy Access IP’ initiative 
(http://www.easyaccessip.org.uk/) may be 
suitable for smaller biotech collaboration 
with universities?”>. Other innovative life 
science platforms also exist for easy adaption 
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to create an IP registry and exchange market 
(for example, http://www.bioinnovit.com/). 

As the wider IP life-sciences landscape 
shifts to accommodate genomics and 
regenerative medicine, other forms of IP are 
surfacing as alternative sources of commercial 
value beyond patents. Trade secrets and access 
to corporate biobanks may become the IP 
concerns for the future of iPSC innovation 
and public health. 

The lengthy regulatory path to market 
for cell therapies may cause patents to 
expire before they can be commercially 
exploited. Trade secrets prevent the sharing 
of knowledge but circumvent patent expiry 
and uncertainty issues. Consequently, 
trade secrets on improved techniques and 
technical know-how may become the more 
viable alternative and are appearing as an 
IP instrument in current industry practice. 
Cellular Dynamics International, for example, 
is a leading assignee of iPSC technology and 
supplies iPSC lines. The company has adopted 
trade secrets as a vital part of its IP portfolio”. 


Conclusions 

Unlike human embryonic stem cells, the 
pivotal impending legal concern for iPSC 
technology is not patentability. Instead, it is 
the potential formation of a patent thicket and 
mismanaged licensing practices in a field that 
has begun to privatize at an early preclinical 
phase. The likelihood of these potential 
challenges occurring is uncertain. Innovative 
IP licensing solutions, industry growth 
cycles and the promise of next-generation 
breakthroughs suggest that such issues should 
resolve themselves; however, this laissez-faire 
approach may come at too high a risk for the 
industry and for global health. 

Undeniably, the requisite conditions for 
patent-thicket formation exist. The complex 
landscape lends credibility to concerns that it 
is becoming increasingly difficult to navigate 
and negotiate access to core technologies from 
dispersed patent holders. This is exacerbated 
with uncertain validity of overlapping claims by 
often still-pending patents. As the technology 
matures, this should be less of an issue. 

A pervasive multisector and multinational 
collaborative culture has already begun within 
the iPSC community’. The damaging IP 
barriers that potentially loom in the near 
future could be avoided if stakeholders could 
equally embrace collaboration (beyond 
biobanks and data sharing) around IP. 
A collaborative approach to innovation 
could be achieved through well-managed 
IP-exploitation companies or through 
informal pooling of interrelated patents 
for certain core technologies through a 
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clearinghouse. These actions would go far to 
remove the perceived threat of IP barriers and 
better serve public health. 

Encouraging low licensing fees cultivates 
mutually beneficial trading partners. 
Government and not-for-profit institutions 
are well positioned to lead the way for iPSC 
technologies in discouraging obstructive 
practices. This should include terms and 
conditions, such as for funding or material 
transfer agreements, which prohibit or 
create disincentives for exclusive licensing, 
trade secrets and unreasonable access fees 
for biobanks. The implications of current 
licensing practices are only now beginning 
to surface. Negotiating cumulative licensing 
schemes from splintered ownership will 
become increasingly difficult without 
collaborative efforts to consolidate and 
ensure an affordable, transparent approach 
to innovation. 
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Can open-source R&D reinvigorate 


drug research? 


Bernard Munos 


Abstract | The low number of novel therapeutics approved by the US FDA in recent 
years continues to cause great concern about productivity and declining innovation. 
Can open-source drug research and development, using principles pioneered by 


the highly successful open-source software movement, help revive the industry? 


Open-source research, which started as a 
counterculture movement in the software 
industry 15 years ago, has since grown into a 
business model whose best-known product, 
Linux, has become a credible alternative to 
Microsoft’s Windows. Now, with biology 
increasingly becoming an information- 
orientated science, some have suggested 
that what worked for software might be part 
of the answer to the spiralling cost of drug 
R&D. With this in mind, this article exam- 
ines the relevance to pharmaceutical R&D 
of the open-source model developed by the 
software industry. In this context, open- 
source no longer refers to source code, but 
instead to the open origin of contributors. 
Open-source R&D has already made 
inroads into bioinformatics and research 
tools for drug hunters. However, key 
differences between software and biology, 
such as regulatory requirements, have 
limited its application to drug development. 
Nevertheless, in the past 5 years a new 
breed of organizations called public-private 
partnerships (PPPs) have adapted the 
open-source concept and combined it with 
outsourcing to create a new, low-cost 
business model, which they have applied 
with encouraging results to the discovery 
of new treatments for neglected diseases. 
Advances in data mining, visualization 
and networking now make it feasible to go 
one step further. It is possible to offer scien- 
tists a computerized toolbox that lets them 
harness the creativity of numerous volunteers 
to address the key questions that are holding 
back innovation. For example, what is the 
aetiology of a disease? What are the pathways 


involved? What are the better targets? Once 
these questions are answered, laboratory and 
clinical studies can be outsourced to institu- 
tions with the requisite capacity through the 
help of matchmaking software. 

The resulting model is a hybrid in which 
a part of R&D is open-sourced while the 
rest is outsourced. To function, however, 
it needs strong project leadership and 
expertise in the minutia of drug R&D, 
which mostly exist in big pharmaceutical 
firms. This suggests that, far from being 
a threat to conventional drug R&D, 
open-source could be a way to leverage 
big pharma’s capabilities in order to tackle 
challenges that the blockbuster model can- 
not address economically, such as neglected 
diseases. As pharmacogenomics takes hold, 
it might also be a way to address market 
niches that cannot support blockbusters. 


A brief primer on open-source 
Open-source R&D is a novel approach to 
research that lets scientists join hands freely 
across organizations, disciplines and borders 
to solve problems in which they share an 
interest. The movements icon is Linux, the 
operating system started in the early 1990s by 
student Linus Torvalds, who used the nascent 
Internet to circulate it to fellow computer 
enthusiasts. Soon they were busy adding fea- 
tures and improving the code, with Torvalds 
overseeing the process. Fifteen years later, 
this grassroots experiment has blossomed. 
into a new culture that is spreading to other 
disciplines. It is most prominent in computer 
software development, for which dedicated 
websites such as such as SourceForge or 


Subversion help over a million people 
collaborate on more than 100,000 projects. 
But other areas, such as life sciences, have 
spawned open-soutce initiatives of their own. 

The impetus to create open-source soft- 
ware often comes from developers looking for 
challenge. They agree on an attractive project, 
form a team and produce a ‘bare-bones’ pro- 
gram with basic functionality. Then, they offer 
it at no cost on the Internet under a public- 
domain license (there are many different 
types of open-source license; some, notably 
the ‘copyleft’ or General Public License (GPL), 
require those who download a program to 
share any improvements they make). If the 
project draws interest, others add features and 
post their code on the project’s webpage for 
fellow programmers to critique. New code of 
sufficient quality is added to the authorized 
version of the program. 

Open-source’ chief benefit is to cross- 
fertilize minds and tap creativity quickly, 
cheaply and on a scale that is beyond the reach 
of scientists working in the ‘ivory towers’ of 
academia or behind the ‘corporate moats’ 
of industry. Hollingsworth’? has shown that 
innovation spikes when diverse minds interact 
frequently in an unstructured manner. By 
drawing talent from all around the world, 
open-source research takes these dynamics 
to a new scale. And by making innovation 
immediately available to all, it speeds up the 
accumulation and application of knowledge. 

Outsiders are often puzzled by the open- 
source idea. Why would anyone work for 
free? Simply put, because some people value 
non-cash compensation more than money. 
They volunteer their expertise to satisfy ideal- 
ism or curiosity, seek new challenges, hone 
skills, build a reputation or enhance careers. 
Feldman’ quotes the example of Australian 
programmers who, within hours of Netscapes 
release of its browser code, attached an 
‘add-on’ to enable secure internet transactions. 
No money changed hands, but the authors 
received respect from the programming com- 
munity and the satisfaction of turning out an 
elegant and useful piece of software. 

Companies are learning to use open- 
source to their advantage, and many now 
allow their employees to participate on com- 
pany time. They might use it to gain market 
share against entrenched competitors, or 
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to entice developers to create applications 
for their product, possibly in the hope of 
turning it into a ‘platform. Some of them 
have been quite successful at turning open- 
source into profits. Red Hat, for instance, 
has attained a US$5-billion market cap from 
selling support services for Linux. 


Can it work for drugs? 

If biomedical scientists could adapt the 
open-source model, it could make a huge 
difference to such projects as developing 
drugs for neglected diseases, for which 
needs are great but funds are scarce’. Only 
10% of R&D resources are spent on illnesses 
that represent 90% of the burden of disease. 
Open-source drug R&D might not change 
that equation, but could make it possible 

to get much more from that 10%. 

There are, however, significant barriers to 
the deployment of open-source approaches 
to drug R&D*. One is economic. All it takes 
to write open-source software is a laptop and 
an internet connection. With drug research, 
someone must pay for laboratory expenses 
and clinical trials. And the costs are high, at 
more than US$800 million for the discovery 
and development of a novel drug by most 
estimates. 

Research dynamics between the two indus- 
tries also differ. Software development does 
not have a discovery phase. Once the objective 
is set, programmers set to work and make 
steady progress towards their goal. By contrast, 
drug discovery cannot flourish until a certain 
amount of knowledge about the target disease 


has been accumulated. That knowledge 
acquisition can take years or decades, with no 
way to know at the outset whether the store 
of knowledge at hand is nearly sufficient or 
will require years of painstaking additional 
research before innovation can thrive. 

Software development is also simpler: 
it spans only a few disciplines and has no 
equivalent to clinical trials. For the most 
part, a single programmer can master all the 
skills needed to write a program from start 
to finish. By contrast, drug development 
requires coordination of multiple specialties 
with little overlap. Biomedical knowledge, 
which grows at the rate of 1,000 publications 
per day, must be peer-reviewed and repli- 
cated before it is accepted. All this is slow 
and enormously expensive. 

Drug R&D can go off-track more easily 
than software programming. Biologists 
can get mired in the complexity of biology 
without ever making much progress towards 
a drug — chemists handed the wrong target 
cannot do much good no matter how hard 
they try; inadequate toxicology can derail 
a compound late in development, or even 
after launch. One misstep along the way can 
render all downstream work useless. 

In contrast to drug developers, software 
publishers are lightly regulated. They do 
not need FDA approval. The quality stand- 
ards they face are far less onerous than the 
minutia of Good Laboratory Practice 
(GLP), Good Clinical Practice (GCP) and 
Good Manufacturing Practice (GMP). 
One sloppy programmer seldom jeopardizes 


the achievements of others, and errors can 
be patched without requiring the rewrite of 
the whole program. With drugs, one care- 
less worker can compromise years of work 
costing tens of million of dollars. 

Finally, the two industries follow different 
intellectual property regimes. Software is 
protected by copyrights that arise automati- 
cally as code is written, even if nothing is 
filed. Drug research is protected by patents 
that are costly to file and maintain, and 
for which meeting the legal standards that 
define innovation is much harder. 


Open-source biomedical research 
Early efforts. Despite these differences, the 
open-source idea has entered biomedical 
research’. The first inroads were made in bio- 
informatics”*, as might have been expected. 
These efforts resulted in a collection of pro- 
grams such as Biojava, BioPerl, BioPython, 
Bio-SPICE, BioRuby and Simple Molecular 
Mechanics for Proteins’, and inspired other 
initiatives such as the Human Genome 
Project, the SNP Consortium, the Alliance 
for Cellular Signaling, BioForge, GMOD 

and Massachusetts Institute of Technology's 
BioBricks (some of these have the transpar- 
ency and feel of open-source, although the 
resources needed to get involved do not allow 
all volunteers to participate; however, we still 
call them ‘open-source’). 


An old idea. One could argue that there has 
long been an active, if invisible, collabora- 
tive process akin to open-source in drug 
development, as, for some diseases, half 

of all prescriptions are for off-label uses”. 
Somehow, physicians share their ideas and 
experiences informally to uncover novel 
uses for existing medicines. For instance, 
oncologists routinely use drugs approved for 
one kind of cancer to treat other types. Ina 
recent study, DeMonaco" found that 59% of 
drug therapy innovations were discovered 
by practicing clinicians via field discovery. 
The way by which physicians uncover these 
new indications is quick and inexpensive 
compared with Phase III trials. From an eco- 
nomic and medical standpoint, there would 
be merit in exploiting these clinical observa- 
tions and sharing them with physicians as a 
complement to, or replacement for, some of 
the traditional clinical development. 


Public-private partnerships. Taking a different 
approach, a new kind of organization, known 
as the public-private partnership (PPP), has 
recently developed a clever virtual business 
model that emulates the collaborative features 
of the open-source concept’”. An example is 
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Box 1| The Medicines for Malaria Venture 
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The Medicines for Malaria Venture (MMV) is run by a staff of 13. Its CEO reports to a Board of 12 
Directors who represent funding organizations. An Expert Scientific Advisory Committee, which 
includes chemists, biologists, clinicians, malariologists and drug development experts, advises on 
project selection and research strategy. The Management Team’s responsibilities are to: 


e Encourage the submission of research proposals 


e Select proposals and negotiate with partners 


e Set up project-management teams and monitor progress 


¢ Organize manufacturing and marketing 


e Earn appropriate returns from marketed products 


e Raise funds 
¢ Communicate with government agencies 


Backers 

e Gates Foundation 

¢ BHP Billiton 

e ExxonMobil 

© Global Forum for Health Research 


e International Federation of Pharmaceutical Manufacturers Associations 
e Netherlands Minister for Development Cooperation 


° Rockefeller Foundation 


e Swiss Agency for Development and Cooperation 


¢ United Kingdom Department for International Development 
e United States Agency for International Development 


© World Bank 
© Wellcome Trust 
e World Health Organization 


the Medicines for Malaria Venture (MMV), 
which was established in 1999 to discover and 
develop new, affordable antimalarial drugs. 
Established as a nonprofit entity with a staff 
of only 13 people, it has assembled a portfolio 
of 19 projects ranging from discovery to 
Phase ITI (FIG. 1). 

MMV gets its projects through open 
calls — anyone with an idea can contribute. 
An Expert Scientific Advisory Committee 
reviews the submissions and selects the 
projects that will be funded. Each is managed 
by a project manager who outsources the 


R&D to a network of 300 scientists at 40 insti- 
tutions (universities, big pharma, biotechs 
and research institutes). Funding comes from 
public and philanthropic partners (BOX 1). 
After each step, the Scientific Advisory 
Committee reviews the data and decides 
whether to proceed or terminate the project. 
MMV’s cumulative spend from 2000 through 
2005 is about US$100 million, 90% of which 
funded actual research. MMV plans to out- 
source manufacturing to low-cost partners, 
sell drugs at cost to developing countries, and 
market them through partners in developed 
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markets (for example, to treat travellers). 
Its alliance with GlaxoSmithKline supports 
25 scientists funded equally by the partners. 

The Initiative on Public-Private 
Partnerships for Health reckons that there 
are about 24 PPPs engaged in drug and 
vaccine R&D (TABLE 1). Most of them were 
created in the past 7 years and share a com- 
mon profile’’. First, they focus on neglected 
diseases. Second, they operate as virtual drug 
companies, with a small staff getting project 
ideas from outside, vetting them through a 
committee of experts and outsourcing R&D 
to a network of institutions. Third, they man- 
age growing portfolios of projects ranging 
from discovery through to Phase III trials. 
Fourth, they have been able to function on 
lean budgets with a cumulative spending 
that seldom exceeds US$50 million. This 
makes them attractive vehicles to fund 
research in areas that are not economical for 
traditional drug R&D. 

By the end of 2005, PPPs had attracted 
funding in excess of US$1.5 billion. 
Foundations have given about US$1.15 billion 
(with the Gates Foundation alone contributing 
more than US$950 million), governments 
US$244 million and private entities US$36 
million. In addition, donors have committed 
another US$3.5 billion which will be 
disbursed as needed by The Global Fund to 
Fight AIDS, Tuberculosis and Malaria. 


Open-source versus alliance networks. It can 
be argued that the 25,000 alliances wrought 
by the 8,000 pharma and biotech companies 
over the past 15 years add up to a vast open- 
innovation system that mimics the collabo- 
rative features of the open-source model. 
Some scholars have countered, however, that 
alliances are less effective than open-source 
research at promoting innovation. This is 
because open-source networks are richer in 
‘weak links’ (loose relationships), whereas 
alliances pride themselves on the strength 
of the connections between partners. 
DeBresson™ has shown that weak links bring 
novel ideas into the fray whereas strong links 
tend to reinforce orthodoxies. 


PPPs and big pharmas. TABLE 2 lists some of 
the projects and organizations coordinated 
by PPPs. As can be seen, GlaxoSmithKline 
features prominently, with Bristol-Myers 
Squibb, Novartis, Bayer, Sanofi-Aventis and 
Ranbaxy involved to a lesser extent. 


Lessons learned 

PPPs have advantages and drawbacks 
compared with traditional R&D". Advantages 
include the following. 
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Table 1| Public-private partnerships engaged in drug and vaccine development 


Name 
Aeras Global TB Vaccine Foundation 
BIO Ventures for Global Health 


Consortium for Industrial Collaboration in 
Contraceptive Research 


Contraceptive Research and Development 


Dengue Vaccine Project 


Drugs for Neglected Diseases Initiative 


European Malaria Vaccine Initiative 


Gates Foundation—UNC Partnership for 
Development of New Drugs 


Global Alliance for TB 
Global Microbicide Project 
Human Hookworm Vaccine Initiative 


Infectious Disease Research Institute 


Institute for One World Health 


International AIDS Vaccine Initiative 
International Partnership for Microbicides 


Japanese Pharmaceutical, Ministry of 
Health, WHO Malaria Drug Partnership 


Lapdap Antimalarial Product Development 
Lassa Fever Initiative 

Malaria Vaccine Initiative 

Medicines for Malaria Venture 

Meningitis 

Microbicides Development Programme 
Pediatric Dengue Vaccine Initiative 
PneumoADIP 


Focus Year created 
Tuberculosis 1997 
Biotech drugs for neglected 2004 
diseases 

Development of new 1995 
contraceptives 

Improving reproductive health in 1986 
developing countries 

Dengue fever 1989 
Sleeping sickness, visceral 2003 
leishmaniasis, Chagas disease 

Malaria 1998 
African trypanosomiasis, 2000 
leishmaniasis 

Tuberculosis 2000 
New microcides for women 2000 
Hookworm 2000 
Tuberculosis, leishmaniasis, 1993 
Chagas disease, malaria, leprosy 

and Buruli ulcer 

Visceral leishmaniasis, cutaneous 2000 
leishmaniasis, Chagas disease, 

paediatric secretory diarrhoea 

AIDS 1996 
HIV 2002 
Malaria 1999 
Malaria 1998 
Lassa fever 2001 
Malaria 1999 
Malaria 1999 
Meningitis 2001 
HIV 2001 
Dengue 2001 
Pneumococcal vaccines 2004 


UNC, University of North Carolina; WHO, World Health Organization. 


Agility. Virtual R&D makes it easier to 


Risk sharing. The open-innovation 


terminate projects that no longer look 
promising. The project manager does not 
have to deal with entrenched advocates 
manoeuvering to save their project or move 
it underground. 


Creativity. PPPs enable experts from dif- 
ferent countries, specialties and styles of 
thought to leverage each other's ideas. They 
harness the problem-solving skills of a much 
greater population than is typically available 
to traditional research organizations. 


Focus. PPPs focus on one or few diseases. 
This helps them build deep expertise for better 
decisions (for example, target selection). 


model of PPPs makes it easier for scientists 
to collaborate on pre-commercial research 
such as biomarkers or cell signalling. 


Affordability. PPPs lower the critical mass 
required to be a pharmaceutical company. By 
leveraging external expertise and capabilities, 
they allow small organizations to do much of 
what was once the domain of large companies. 


Impact. PPPs engage scientists in developing 
nations who have first-hand experience in 
many neglected diseases. It helps them build 
their clinical research capacity, which in turn 
leverages the effectiveness of their public 
health systems. 


Speed. Lean PPPs can decide quickly, partly 
because they do not have layers of committees 
to satisfy. In addition, because they tap their 
partner's unused capacity, they can advance 
swiftly as there is often a qualified laboratory 
somewhere that can do the work without 
having to wait in someone else's queue. 


There are also some disadvantages to PPPs, 
which include the following. 


Funding. US$5 billion has been committed 
to PPPs ($1.5 billion disbursed). However, 
despite the thriftiness of PPPs, there is 
concern that these funds will be stretched 
as more projects move into late, expensive 
clinical development. 


Sustainability. PPPs have not demon- 
strated the sustainability of their business 
model. Some of their projects come from 
companies that had shelved them because 
of insufficient commercial prospects. To 
survive, PPPs will need to replenish their 
portfolios. There are also worries that, in 
some areas of science, the pool of contribu- 
tors might be too thin to perform the work 
that must be done. 


TABLE 3 shows that the PPP R&D model 
has worked reasonably well. Some of this 
success comes from targeting low-hanging 
fruits in diseases that have long been 
neglected, but it also suggests that the PPP 
model can be a potent tool in finding new 
cures. Whether the PPP business model 
becomes a transformational force or 
remains a non-threatening niche depends 
on how it ultimately performs against 
traditional pharmaceutical R&D. To 
succeed, it must go beyond tools and soft- 
ware and tackle large projects where it will 
rival the big firms that are helping it today. 
Yet, this rivalry need not be a zero-sum 
game. On the contrary, there is a place for 
collaborative and proprietary research in 
drug R&D, just as in software'®. If open- 
source drug R&D takes hold, what will 
probably emerge is not the replacement of 
one model by another, but an ecology in 
which big pharma, biotech and collabora- 
tive research compete and collaborate at 
the same time, feeding off each other 
synergistically, while moving towards 
therapies along their own distinctive paths. 


A template for open-source drug RGD 
Can the PPP model succeed beyond 
neglected diseases? To answer this, it helps to 
break down drug R&D into knowledge-based 
activities and rule-based tasks. 
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Knowledge-based work requires lots 
of intelligence and intuition, but little 
infrastructure. Examples include identifying 
targets, understanding metabolic networks, 
and designing clinical trials or computer- 
ized disease models. It is about scientists 
leveraging each other’s ideas, and using 
tools to gain deeper insights that might lead 
to breakthroughs. This work is ideally 
suited to the open-source model. 

Rule-based work requires physical assets 
(laboratories, equipment, patients and so on) 
and money. It is tightly scripted and must 
conform to rigid regulatory requirements. 
It is about organization, discipline and 
implementation. Examples include toxicol- 
ogy studies, Chemistry Manufacturing and 
Controls (CMC) studies, and the conduct 
of clinical trials. Rule-based work is ideally 
suited to outsourcing, and much of it is 
already outsourced to contract research 
organizations. 

This division of labour suggests a busi- 
ness model template in which part of the 
R&D value chain is open-sourced, while 
the rest is outsourced, with the following 
features. 


Template features: operating principles 
Open-sourcing. The open-source part of 
our model should allow anyone who can 
contribute to join. Volunteers should be 
able to log on to a website, find the page(s) 
that matches their area of expertise, peruse 
challenges to be solved, review others’ con- 
tributions, download computerized tools 
and start working towards contributions of 
their own. As they progress, they can pub- 
lish their findings in scientific journals and 
discuss their insights in on-line forums. 
Over time, the better ones will gain authority 
and become the de facto leaders of their 
open-source community. 


Outsourcing. Work to be outsourced 

should be posted on a website for all to see. 
Scientists and organizations qualified for the 
job can bid, and the sponsor picks the best 
candidate for each task. 


Template features: procedures 
Governance. Three decision-making bodies 
provide leadership and guidance: the Board 
of Directors, the Steering Committee 

and the Scientific Advisory Committee. 
The Board of Directors includes senior 
executives and outsiders who represent 
shareholders and stakeholders. It approves 
strategy and ensures that management 
performance is consistent with the organi- 
zation’s mission. The Steering Committee 
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Table 2 | Public-private partnerships and their partners 


Project Industry partner —_ University/public Other PPP 
health partner 
Medicines for Malaria Venture 
Improved 4-aminoquinoline GlaxoSmithKline University of Liverpool 
Farnesyl transferase inhibitors  Bristol-MyersSquibb University of Washington 
Manzamine derivatives University of Mississipi 
Cysteine protease inhibitors GlaxoSmithKline WESE 
Fatty acid biosynthesis Texas AGM, A. Einstein, 
inhibition Jacobus 
Pyridone GlaxoSmithKline 
New di-cationic molecules Immtech University of North 
Carolina 
Dihydrofolate reductase Biotec Thailand 
inhibition 
Artesunate derivatives GlaxoSmithKline, TDR, DNDi 
Shin Poong 
Artemisone Bayer University of Hong Kong 
Synthetic peroxide Ranbaxy University of Nebraska 
Intravenous artesunate Walter Reed 
Coartem in infants Novartis TDR 
TB Alliance 
Pyridones and quinolizines Taejon, Yonsei 
Isoniazid analogue Wellesley College 
PA-824 NIH, Johns Hopkins 
Mocifloxaxin Bayer CDC, Johns Hopkins 
Institute for One World Health 
Paromomycin TDR 
Azole Yale 
TDR 
Miltefosine Zentaris 
Oral eflornithine Aventis 


CDC, Centers for Disease Control and Prevention; DNDi, Drugs for Neglected Diseases Initiative; NIH, 
National Institutes of Health; TDR, UNICEF-UNDP-World Bank-WHO Special Programme for Research 


and Training in Tropical Diseases. 


is a group of senior executives that rules 
on important operational issues such 

as fundraising, budgets, project funding, 
key hires and selection of partners. 

It also approves recommendations from 
the Scientific Advisory Committee. The 
Scientific Advisory Committee (SAC) is a 
group of external experts from academia 
and industry. It sets R&D strategy, proposes 
new projects, reviews existing ones and 
recommends termination of those that no 
longer deserve support. 


Scope. This template calls for focusing 
on single diseases or related illnesses. 

An organization working on unrelated 
diseases should establish separate websites 
for each one. 


Projects origination. There is a permanent 
open call for new projects. Scientists are 
invited to submit ideas online for review by 
the SAC. 


Portfolio management. The SAC is 
responsible for maintaining an adequate and 
balanced pipeline of projects. 


Project management. Each project is man- 
aged by a Project Team led by a member 

of the organization, and staffed by external 
experts in drug discovery, clinical research 
and regulation. The Project Team is respon- 
sible for developing the budget and timeline, 
overseeing outsourced tasks and ensuring 
compliance with GxP. One of its crucial 
duties is selecting what will be open-sourced 
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Table 3 | Public-private partnerships lower the critical mass required to discover and develop new cures 


Organization Focus Staff Pipeline 
Number of Discovery 
projects 
Medicines for Malaria 13 19 12 
Malaria Venture 
TBAlliance Tuberculosis 18 12 
Drugs for Neglected Trypanosomiasis, visceral 36 20 9 
Disease Initiative leishmaniasis, Chagas disease 
OneWorld Health Leishmaniasis, malaria,Chagas 40 5 1 
disease, diarrhoeal diseases 
International AIDS HIV/AIDS 169 6 - 
Vaccine Initiative 
Malaria Vaccines Malaria 32 10 4 


Initiative 


Source: Annual reports. PK, pharmacokinetics. 


and what will be outsourced. The project 
leader is accountable for generating the data 
used to decide whether to fund the next step. 
Commitment to a project is limited to the 
current step, until the data warrants commit- 
ting funds for the next one. Open-sourced 
tasks are posted on the project’s website, 
each on its own page, and outsourced ones 
are posted on a companion matchmaking 
website such as Innocentive, or Scienteur. 
Outsourcing bids are reviewed by the Project 
Team, which issues recommendations 

to the SAC. 


Intellectual property ownership. There is 
often a misperception that open-source 
initiatives are hostile to patents and bent 

on putting discoveries in the public domain. 
The reality is more nuanced. Most open- 
source activities occur at a pre-commercial 
R&D stage, when the ideas and hypotheses 
debated fall short of the legal standards that 
define inventions in patent law. They are an 
on-going scientific conversation that can be 
likened to a global instant-messaging system 
linking scientists interested in a topic. In that 
sense, open-source is no more threatening 
to patents than other forms of scientific 
publishing. A scientist who engages in that 
conversation and comes up with an idea that 


can lead to a patentable invention will need 
to exercise caution with disclosures until the 
invention has been reduced to practice and 
patent applications have been filed, just as 
would be necessary in a traditional research 
setting. It is generally accepted that open 
communication promotes advancement 

of science, but needs to be balanced by the 
need to protect the rights of inventors. 

The same applies to open-source activities. 


Template features: tools 

The discovery toolbox. As of February 2006, 
349 genomes have been published and 
another 1,575 are being sequenced. A new 
generation of smart, computerized tools 

is becoming available to mine data, comb 
the literature, map metabolic networks, 
perform in silico modelling, visualize bind- 
ing sites, identify chemical leads, design 
molecules and predict toxicity. These tools 
should be packaged into a convenient 
toolbox, together with access to major 
databases, and offered to volunteers willing 
to contribute their expertise. 


Outsourcing software. Several programs 
already exist to match projects with talent 
and capacity. Two examples are Scienteur, 
a free e-marketplace that allows companies 


Table 4| R&D costs for public-private partnerships (US$ million) 


Stage MMV TB Alliance 
Discovery and PK 8.3 18.6 

Phase | 1.6 0.6 

Phase Il Lp 3.4 

Phase Ill 9.5 22.6 

Total clinical 2 26.6 


DNDi IAVI Big Pharma 
16.2 20.0 26.0 
Unpublished 2.0 15.2 
Unpublished 5.0 D3) 
Unpublished 30.0 86.3 

24.2 37.0 125.0 


Source: REF. 19. DNDi, Drugs for Neglected Diseases Initiative; IAVI, International AIDS Vaccine Initiative; 


MMV, Medicines for Malaria Venture. 


PK Clinical Cumulative 
spending through 
2005 (US$ million) 

1 6 103 

3} 20 

4 7 20 

3 il 2 

1 5 120 

1] 4 ? 


to post tasks, and experts to register their 
skills, and Innocentive, an online problem- 
solving tool that lets a company post a 
challenge with a reward: whoever finds the 
solution gets the money. 


Template features: costs 

PPPs have been able to function on very low 
budgets for several reasons (TABLE 4). First, 
they have few people, low overhead costs 
and no fixed assets. They rely on someone 
else’s unused capacity, and the market seems 
to price such capacity at marginal instead 
of full cost. Second, they outsource much of 
their work where it is cheaper to do so and 
do most of their trials in developing coun- 
tries. Third, they concentrate on infectious 
diseases for which costs are lower. Fourth, 
they receive in-kind donations. 


Will it work? 

Despite the promise of open-source drug 
R&D, both its pioneers, and the veterans 

of open-source software, point to several 
potentially troublesome issues that could 
affect the success of the open-source model. 


Availability of talent. Typical open-source 
projects do not require a large number of 
contributors. Data from the software indus- 
try suggests that the ideal number ranges 
from 6 to 20 people. Yet much of the drug 
R&D expertise resides in an industry that 
has a strong proprietary culture. Employees 
are routinely asked to assign their intellectual 
output, including that created on their own 
time, to their employers’’. This could stifle 
talent supply in key areas. Two developments, 
however, might give open-source drug R&D 
the permanent talent pool it needs. First, 
thousands of highly trained pharmaceutical 
scientists are nearing retirement and might 
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welcome the opportunity to put their skills 
to good use. Second, drug companies might 
be persuaded to ease restrictions on their 
employee's involvement. There is indeed little 
conflict of interest in being a cancer scientist 
by day and an anthrax researcher at night, 
and firms might gain valuable goodwill from 
letting employees seek cures for diseases in 
which they have no interest. 


Availability of data and standards. Open- 
source scientists cannot accomplish much 
unless they can access data. Biological data 
is plentiful and getting richer, with terabytes 
of genomic and metabolic data continuously 
being added to the pool. Chemical and 
structural data, on the other hand, are more 
scarce. In addition, the formats used to 
handle these data are still evolving. Biologists 
use a reasonably small number of them, but 
chemists are further from such consensus. 
Both the lack of standards and the scarcity of 
data in certain areas can cause problematic 
choke points in an open-source R&D effort. 


Availability of tools. Open-source scientists 
need open-source tools to practice their 
craft. Until recently, such tools were plentiful 
in bioinformatics, but less so in chemistry, 
which has long been dominated by com- 
mercial software. This is changing. The 2004 
launch of PubChem has brought online a 
powerful suite of tools that allows scientists 
to connect chemical information with 
biomedical research and clinical informa- 
tion in an unprecedented way. Non-profit 
scientists can now access small-molecule 
high-throughput screening, chemistry and 
informatics on a scale previously available 
only to industry. They can even get grants to 
turn their online discoveries into assays for 
high-throughput screening'*. Other tools 
such as eMolecules, Jmol or the Chemistry 
Development Kit are adding powerful 
chemical search and visualization capabilities 
to the open-source scientist’s toolbox. 


Intellectual leadership. Just as putting ingre- 
dients into a vat does not necessarily cause 
them to react, connecting smart people online 
does not guarantee they will produce anything 
valuable. In both cases, a catalyst is needed. 
For open-source drug R&D, the presence of 
a subgroup of highly innovative contributors 
who can tune in the on-going conversation 
and fuel it with their own creative insights acts 
as such a catalyst. Without it, the conversation 
could remain shallow and fizzle out. 


Momentum. Enticing people to join is a 
challenge. A good website helps, but it’s not 


enough. As Darren Carroll, former CEO of 
Innocentive, puts it, “If you build it, they will 
not come!” It takes a sustained effort to get the 
word out and build trust with stakeholders. 

It also takes a leader who can connect with 
people, understand their motivation and foster 
trust. Linux attracts thousands of contributors 
because they identify with Torvalds’ ideals and 
trust him to do the right thing. Open-source 
drug R&D must build such leaders. 


Web interface. The design of the project's 
website is crucial. It must be engaging and 
appeal to visitors’ curiosity. They must be 
able to quickly find the pages that match 
their interests, download the toolbox, and be 
‘up-and-playing’ in minutes. 


Quality assurance/quality control. When 
something as complex as drug R&D gets 
parceled out around the world, quality 
assurance can become an issue. Oversight, 
due-diligence, audits, good practices and 
prior experience can be used to ensure quality. 
International Organization for Standardization 
standards could also help in the future. 


Selectivity. Not all projects will be equally 
suitable. Cancer might draw contributors, 
but hair loss might not. 


Conclusion: a new ecology of drug R&D? 
Is there still room for big pharma in open- 
source R&D? One must stress that ‘virtual’ 
does not mean ‘leaderless. To succeed, 
open-source R&D will need deep expertise 
in the minutia of drug R&D, which today 
resides overwhelmingly in the pharma- 
ceutical industry. There might be many 
volunteers, but they must be shepherded 
towards a goal. Such stewardship is a core 
competency of pharmaceutical companies. 
Our model is not a substitute for them, but 

a way to leverage their capabilities to tackle 
unmet medical needs, such as the diseases of 
poverty, orphan diseases and niche markets. 
Pharmaceutical companies stand to gain 
from co-opting the open-source model and 
allowing it to flourish in ‘coopetition’ with 
traditional R&D, to handle the diseases or 
R&D steps for which it is best suited. 
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SourceForge: http://sourceforge.net/ 

BioForge: www.bioforge.net 

PubChem: http://pubchem.ncbi.nlm.nih.gov 

eMolecule: www.emolecules.com 

Jmol: http://jmol.sourceforge.net 

CDK: http://almost.cubic.uni-koeln.de/cdk 

Emboss: http://emboss.sourceforge.net/ 

Medicines for Malaria Venture: www.mmv.org 

The Initiative on Public-Private Partnerships for Health: 
www.ippph.org 

Global Fund to Fight AIDS, Tuberculosis and Malaria: 
www.theglobalfund.org 

Aeras, Global TB Vaccine Foundation: www.aeras.org 
Drugs for Neglected Diseases Initiative: www.dndi.org 
Global Alliance for TB: www.tballiance.org 

Institute for One World Health: www.oneworldhealth.org 
International AIDS Vaccine Initiative: www.iavi.org 
Malaria Vaccine Initiative: www.malariavaccine.org 
Access to this interactive links box is free online. 
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Bring out your dead 


EDITORIAL 


Despite recent progress, only a fraction of the drug industry’s shelved compounds are shared with the research 
community. Could online collaborative research offer a solution? 


ailure—punctuated by the odd success—is an integral part of drug 

discovery and development. And industry is very vocal about the 
burden of its failed compounds in inflating the cost of bringing a drug to 
market—now purportedly $2.56 billion according to the latest Tufts study. 
Industry is less forthcoming, however, when disclosing the thousands of 
‘failed’ drug assets to the wider research community. In recent years, both 
the US National Center for Advancing Translational Sciences (NCATS) 
and the UK Medical Research Council (MRC) have made progress in 
coaxing de-prioritized compounds out of company vaults, with the latter 
announcing last month it had brought together “the world’s largest collec- 
tion of de-prioritized compounds’—68 in all. But for drug repurposing 
of failed compounds to truly realize its potential, additional mechanisms 
need to be found that incentivize both large and small companies to release 
data so that they can be accessed and searched by the crowd. 

Drug repurposing, or repositioning, is nothing new. Historically, it has 
often been a serendipitous process in which chance clinical observations 
suggest new indications for an approved drug or drug candidate. In recent 
years, compounds stalled in drug company pipelines due to lack of efficacy 
have received particular attention for repurposing. Such compounds are 
useful because they already have extensive information on safety and effi- 
cacy. Indeed, according to the Institute of Medicine, repurposed drugs can 
be approved faster (within 3-6 years of program initiation), at as little as 
~60% the cost and with three times lower attrition rates than a drug from 
a traditional discovery program. 

Two flagship initiatives that crowdsource researcher ideas for new 
uses are the NCATS’ Discovering New Therapeutic Uses for Existing 
Molecules program and the MRC’s Industry Asset Sharing Initiative. 
Key facets of either program are the provision of boilerplate collabora- 
tive research agreements, which streamline the legal and administrative 
burden between participating researchers and companies, and an open 
crowdsourcing application process followed by a closed second stage 
governed by cooperative agreements. Participants can apply for new use 
intellectual property (IP), with the companies donating the compounds 
getting first right of refusal. 

Thus far, NCATS and MRC have succeeded in engaging eight compa- 
nies from the entire industry. Although it is a promising start, it represents 
a tiny fraction of all discontinued programs. 

Uptake has been slow for several reasons. First, the drug industry is con- 
servative, particularly when repurposing initiatives might throw up safety 
signals about lucrative drugs in portfolios. Second, company insiders may 
dismiss crowdsourcing expertise—after all, they ‘know’ their compounds 
best (the NIH ‘not-invented-here’ problem). And third, sharing com- 
pounds and data for crowdsourcing has significant resource implications. 

To work with NCATS or MRC, a company must rally all the relevant 
data on a discontinued compound; must locate internal champions with 
relevant expertise (who may be working on another asset or have left the 
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company); must appoint an individual to oversee the program (who would 
otherwise be working on another project in the organization); and, most 
importantly, must manufacture sufficient quantities of the compound at 
clinical grade to supply a trial. Industry insiders estimate these direct and 
‘opportunity’ costs may total up to $1 million per compound. 

Clearly, this is prohibitive for many companies, especially the majority 
of small-to-medium-sized enterprises. Even if their overheads are less, 
most biotech companies have insufficient time, staffing and resources to 
participate in an NCATS/MRC collaboration. For companies that run out 
of funding, assets and IP are often left in limbo. 

One alternative to the NCATS/MRC programs would be to create an 
online platform where users could upload data about shelved compounds 
in return for royalty options, milestone payments, and exclusive and non- 
exclusive licenses if compounds are taken forward. These online solu- 
tions could provide collaborative environments in which drug-related data 
would be uploaded, linking out to repositories like PubChem or ChEMBL. 
Researchers could either work in open environments to discuss ideas, 
organize research, and network or collaborate in private environments 
where proprietary information would be shared with a defined group. 

The question is, who would build such a resource? 

One thought is the European Union-funded Innovative Medicines 
Initiative (IMI). IMI has the authority, industry/academic contacts, and 
financial muscle to put the technology together in short order. Failing this, 
several internet-based collaborative platforms might also moonlight for 
this purpose. For example, Collaborative Drug Discovery's (CDD) ‘Vault’ 
cloud-based offering to mine biological and chemical data has securely 
hosted over 250,000 user logins for 10 years now; Cures Within Reach is 
launching the CureAccelerator platform later this year, which has similar 
capabilities and the ability to network with funders. 

The movement of drug discovery from silos in companies and other 
organizations to online platforms that enable collaborations with research- 
ers across the globe is not going to happen overnight. But it is coming. 

It will be enabled by the increasing ability to integrate molecular and 
structural data about compounds, data from electronic medical records 
and high-throughput large-scale phenotyping and genotyping technolo- 
gies. It will be facilitated by greater transparency in disclosing clinical data 
on drugs, exemplified by the European Medicines Agency’s decision to 
publish from this month onwards complete clinical study reports for all 
marketing authorization applications. And it will be driven by the recog- 
nition that crowdsourcing the global research community can invigorate 
drug development with new ideas. 

The upshot will be that collaborative drug discovery will become acces- 
sible to all companies, large or small, across the industry. If that means 
failures can be shared more widely, resources diverted from making the 
same mistakes and a greater number of failed compounds turned into 
successes, the sooner the better. ie 
Corrected after print 8 January 2015. 
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ERRATA AND CORRIGENDA 


Erratum: Reinventing tech transfer 


Brady Huggett 
Nat. Biotechnol. 32, 1184-1191 (2014); published online 5 December 2014; corrected after print 9 December 2014 


In the version of this article initially published online, the name of the University of Pennsylvania president Amy Gutmann was misspelled. The 
error has been corrected in the HTML and PDF versions of the article. 


Erratum: Bring out your dead 


Nat. Biotechnol. 33, 1 (2015); published online 9 January 2015; corrected after print 9 January 2015 


In the version of this article initially published, the cost of bringing a drug to market was incorrectly stated as $2.3 billion. The correct amount, as 
per the Tufts study referenced in the article, is $2.56 billion. The error has been corrected in the HTML and PDF versions of the article. 


Corrigendum: Linking T-cell receptor sequence to functional phenotype at 
the single-cell level 


Arnold Han, Jacob Glanville, Leo Hansmann & Mark M Davis 
Nat. Biotechnol. 32, 684-692 (2014); published online 22 June 2014; corrected after print 14 January 2015 


In the version of this article initially published, the concentration of the V-region primers in the Online Methods section was given as 0.6 uM. 
The correct concentration is 0.06 uM. The error has been corrected in the HTML and PDF versions of the article. 


Corrigendum: Selling long life 


Christopher Thomas Scott & Laura DeFrancesco 
Nat. Biotechnol. 33, 31-40 (2015); published online 9 January 2015; corrected after print 14 January 2015 


In the version of this article initially published, the caption for Figure 5 read “Pope Innocent VIII, likely the first patient to undergo parabiosis.’ In 
fact, he did not undergo parabiosis, but a blood transfusion. The caption should have read “Pope Innocent VIII died in a rejuvenation attempt in 
1492” The errors have been corrected in the HTML and PDF versions of the article. 


Corrigendum: Status and market potential of transgenic biofortified crops 


Hans De Steur, Dieter Blancquaert, Simon Strobbe, Willy Lambert, Xavier Gellynck & Dominique Van Der Straeten 
Nat. Biotechnol. 33, 25-29 (2015); published online 9 January 2015; corrected after print 14 January 2015 


In the version of this article initially published, Figure 3a had three errors. The heights for bars ‘26’ and ‘31’ for US, Vitamin E, broccoli + tomato 
+ potato were ~7 and ~25, respectively; the transgenic (green) value US, vitamin E, tomato was given as ‘24.5; but should be “40.3” The errors have 
been corrected in the HTML and PDF versions of the article. 
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COMMENT 


for big pharma? 


by the pharmaceutical industry? 


In the past, the business model adopted by most high-tech 
companies was traditional ‘closed innovation. Ideas were 
generated internally and taken from concept to commer- 
cialization using vertically integrated internal resources. 
Such closed organizations would guard all their intellec- 
tual property (IP) closely to protect company interests and 
to prevent exploitation by rivals. 

Recently, however, there have been challenges to the 
efficiency of this model as technology has developed and 
the external environment has changed. As well as people 
moving more freely between organizations, taking their 
knowledge and expertise with them, the growth of the 
internet and the pervasiveness of media generally has 
meant that information is much more widely available. 
Consequently, for many R&D challenges, there is now a 
wealth of knowledgeable people distributed around the 
globe with the potential expertise to address them, which 
would be impossible for any individual company to hire. 
Tapping in to this expertise is one of the key goals of open 
innovation’. 


Examples of open innovation 
Open innovation assumes a flexible business model in 
which new product innovation originates from both inter- 
nal and external ideas. Opportunities to source expertise 
and potential products externally for further develop- 
ment, often through collaborations, are actively embraced, 
and internal ideas are allowed to be exploited outside the 
originating organization. Following on from pioneering 
demonstrations of open source development in the soft- 
ware industry — leading to products such as the Linux 
operating system”’ — other business sectors have adopted 
open innovation in varying ways. Although it is too early 
to say whether such strategies have been successful in 
most of these sectors, some advantages have already been 
demonstrated through different collaborative models. 
For example, after the consumer products company 
Procter and Gamble (P&G) reshaped their R&D model 
to harness external innovation in 2000, it reportedly 


Is open innovation the way forward 


The current, fully integrated business model of large pharmaceutical companies is 
increasingly considered to be unsustainable, and so new approaches that engage large 
and small companies, governments and academic institutions are needed. Could ‘open 
innovation’ models that have proved successful in other sectors be fruitfully adopted 


increased its product success rate by 50% and the efficiency 
of R&D by 60%*. Another example is the mining company 
Goldcorp, which successfully utilized ‘crowd sourcing’ by 
creating a competition in which anyone could access its 
geological data online to identify potential new seams of 
gold, with prizes awarded to participants who submitted 
the best methods and estimates’. More than 8 million 
ounces of gold was found through the challenges, reducing 
expected exploration time by 2-3 years, and dramatically 
changing the fortune of the company. 


Open innovation in pharma 

The pharmaceutical industry is facing many of the same, 
if not greater, challenges as the other sectors that have 
adopted open innovation approaches. Increased costs 
and risks; a need to understand better patient and payer 
requirements; and opportunities to gain access to bet- 
ter tools, technologies and ideas, have all driven large 
pharmaceutical companies to re-evaluate their business 
models. At the same time, academic biomedicine is real- 
izing that there are tools, technologies and experience 
within pharmaceutical companies that can assist the 
development of their ideas and research towards clinical 
application. Consequently, there has been a shift in the 
cultural landscape that could allow a wider adoption of 
open innovation approaches. 

Indeed, in addition to the rapid growth of partnering 
activity in the past decade, large pharmaceutical compa- 
nies have already begun to work more actively together 
— as well as with small- and medium-sized enter- 
prises and academic institutions — on pre-competitive 
research’. The success, in terms of numbers of projects 
funded and organizations involved, of the first call of 
the European Innovative Medicines Initiative (http:// 
www.imi.europa.eu) is one example of this among the 
many other pre-competitive consortia globally. 

Several large companies have also been experimenting 
with more radical open innovation models. For example, 
in 2001, Eli Lilly spun out InnoCentive as the first 
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internet problem-solving platform designed to connect 
companies with research challenges to potential solution 
providers’. More recently, in 2009, Eli Lilly announced 
the Phenotypic Drug Discovery Initiative (https:// 
pd2.lilly.com/pd2Web), which makes their assays and 
expertise available to academic institutions to source 
new collaborations and compounds, and Pfizer began 
allowing other organizations to screen against their 
internal compound library. With the aim of harnessing 
large-scale biological data and tools, Sage Bionetworks 
(http://sagebase.org/) was launched in 2009 to build 
complex, predictive models of disease using an open 
innovation model, initiated by comprehensive data 
and analysis tools donated by Merck®. And in the field 
of neglected tropical diseases, GlaxoSmithKline (GSK) 
announced the creation of a patent pool, which aims to 
remove IP as a barrier to research into treatments for 
neglected diseases. GSK has put more than 800 patents 
for compounds or processes into this pool, and Alnylam 
has added a further 1,500 patents. 

Support for open innovation models in general has 
recently been indicated by GSK’s announcement that it 
is partnering to create a new bioscience park based on 
an open innovation model in which companies located 
on the park will have shared access to specialist skills, 
equipment and expertise. Johnson & Johnson's Head of 
Pharmaceutical Research & Development, Paul Stoffels, 
also announced a shift towards an open innovation model 
for the company in 2009. 


Misconceptions 

Although such initiatives demonstrate that support for 
open innovation in the pharmaceutical industry is becom- 
ing increasingly widespread, one barrier to the adoption of 
such models has been the major misconception that open 
innovation is equivalent to open access, and in some way 
undervalues or undermines the concept of IP protection. 
However, IP is actually the currency of open innovation, 
and open innovation relies on the fact that creating solely 
owned and derived IP does not automatically lead to 
success and the creation of commercial value. 

Indeed, across many industries, most patents remain 
uncommercialized; for example, Siemens and P&G 
recently reported that they only used 10% of their patent 
portfolio’. So, there is a real opportunity to create value 
from the sharing of IP. If this IP is not managed actively, 
the asset value is not realized. This becomes especially 
crucial in an environment in which the window of oppor- 
tunity to recoup investment is short, as in pharmaceuticals. 
Proactive IP management looks to identify opportunities 
for sharing IP to create real value. 

Such co-creation involves sharing the costs and 
benefits of innovation, and resulting IP, in line with 
the relative contribution of the various parties. This 
would include royalties and commercialization rights, 
but here IP rights are ensured through appropriate and 
harmonized protection strategies that are agreed on by 
all participants. At the academic-industrial interface, 
there has been some progress in the use of ‘boilerplate 
agreements (for example, the Lambert Agreements in 
the UK) that aid discussion starting from a point that 


is appropriate to the particular collaborative situation 
(http://www.innovation.gov.uk/lambertagreements/). 
This is an approach that could be explored more widely 
for a range of sectors, including pharmaceuticals. 
Another major misinterpretation of open innovation 
is that all collaborations require direct cash from industry 
to be of value, but this need not be the case. For example, 
many academic researchers are as motivated by the tan- 
gible and intangible benefits gained from partnering with 
industry, such as access to tools, reagents and expertise. 
The use of in-kind contributions can allow risk sharing 
and access to diverse thinking without upfront cash con- 
tributions, to the benefit of all parties, and such efforts 
deserve further focus to fully realize their potential. 


Moving forward 

Adopting an open innovation ethos will require a change 
in the values placed by companies on new technologies 
and individual skill sets, as well as strong structural and 
cultural frameworks for optimal operation. New technol- 
ogy must be effectively deployed to maximize the benefits 
of open innovation; for example, the building of exper- 
tise networks and databases to allow the best partners 
to work with each other and dissemination of informa- 
tion about projects across an organization. Individuals 
should be employed who not only have relevant scien- 
tific expertise, but also possess strong external networks 
and are skilled in working with external organizations. 
Senior management must recognize that resources have 
to be applied to nurture collaborations and monitor their 
progress to ensure success. Employee roles may need to 
be redefined to ensure sufficient continuity with exter- 
nal organizations, and to ensure knowledge is being 
maintained within the enterprise. 

Overall, in our view, open innovation is a valuable 
model for large pharmaceutical companies. It provides 
considerable flexibility, aiding the rapid ramp-up or -down 
of activities in particular areas, which would help the 
industry to keep up with the rapid pace of change in the 
external research community. Furthermore, the industry 
would then be better able to face the rapidly increasing 
cost of bringing products to market through risk-sharing 
deals. Finally, such a model would allow pharmaceutical 
companies to become closer to understanding patients 
and their requirements, and the needs of other stake- 
holders. This will not only be beneficial to the phar- 
maceutical industry, but also to academic institutions, 
biotechnology companies and, ultimately, to patients. 


Jackie Hunter and Susie Stephens 
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Precompetitive consortia In 
biomedicine—how are we doing? 


Barbara Mittleman, Garry Neil & Joel Cutcher-Gershenfeld 


Too few precompetitive consortia are being formed to mitigate lost opportunities and deliver on other potential 
mutual gains for public and private stakeholders in drug development. 


Dz the past decade, many drug com- 
panies entered the race to independently 
develop a small molecule inhibitor of p38 
kinase for inflammatory disease. All reached 
the same conclusion—the target is not viable 
owing to fundamental toxicity problems 
related to its biology. Tens of millions of dol- 
lars were invested and countless hours of effort 
were expended for researchers to come to this 
realization. Yet it all could have been so differ- 
ent. Collaboration around the target through 
a precompetitive consortium would have 
decreased the real expenditures and opportu- 
nity costs while retaining the option for each 
company to develop proprietary therapeu- 
tics, had the target shown promise. Here we 
argue that despite the formation of consortia 
as a complement to market competition and 
government regulation in recent years, too few 
exist to mitigate lost opportunities and deliver 
on other potential mutual gains for public and 
private stakeholders in the drug development 
enterprise. 


What are consortia good for? 

Consortia and cooperative arrangements, 
particularly precompetitive arrangements 
that generate shared resources such as data, 
tools and analytics, can help advance the drug 
development enterprise as costs increase, R&D 
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By pooling resources and expertise, precompetitive consortia enable progress in various drug discovery 


and development to move forward. 


spending is cut back, regulatory requirements 
become more stringent, public trust in the 
industry declines and company valuations 
shrink!. At the core, too few new drugs and 
devices are making it to the market to alleviate 
the clinical need and to underwrite the spi- 
raling costs of development. In fundamental 
ways, the processes, structures and cultures 
current in the industry are inadequate to meet 
the demands of biomedicine in the twenty-first 
century. 

Shifting to a more sustainable drug devel- 
opment enterprise requires new models and 
methods for increasing efficiency and econ- 
omy, decreasing and managing risk, stratify- 


NOVEMBER 2013 


ing patients and developing new tools and 
approaches”. Broadly speaking, there are five 
strategic avenues for industry innovation: 
(i) independent operation of each stake- 
holder (a form of competition); (ii) mergers 
and acquisitions (M&As; a different form of 
competition); (iii) strategic alliances (a coop- 
erative model that is similar to but distinct 
from precompetitive consortia); (iv) open 
innovation approaches, such as crowdsourc- 
ing, prizes and challenges (another coopera- 
tive model that is similar to and distinct from 
precompetitive consortia), and (v) precom- 
petitive consortia (comprising, for example, 
industry, government, academia and public, 
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patient or professional advocates). Consortia 
offer unique opportunities for stakeholders 
to redefine the precompetitive space, develop 
new work streams and jointly produce tools 
and resources. 

The synergies resulting from the consortium 
approach promise to save time and money by 
more efficiently and effectively generating ther- 
apeutics, diagnostics and devices to improve 
the public health. Although any avenue for 
innovation has the potential to deliver cost sav- 
ings, cooperative models increase the potential 
for risk sharing. The consortium model best 
fosters collaboration among stakeholders with 
diverse interests and experiences. Consortia, 
combined with the existing competitive institu- 
tional and regulatory arrangements, constitute 
a more robust institutional landscape for the 
entire drug development enterprise. 

Specific and demonstrable successes from 
biomedical consortia include the Biomarkers 
Consortium, the Predictive Safety and 
Toxicology Consortium, the Patient Reported 
Outcomes Consortium, the Genetics 
Association Information Network and the 
National Bone Health Alliance. These models 
exemplify collaboration in areas where indi- 
vidual companies were already active. But con- 
sortia can also encourage innovation. One such 
example of a cross-industry collaboration is 
TransCelerate BioPharma, which was formed 
not only to allow industry participants to col- 
laborate more efficiently with one another but 
also to promote interaction among an ‘ecosys- 
tem’ of participants, including the US Food 
and Drug Administration (FDA), academia 
and patient advocates. Initial projects of this 
consortium include defining clinical data 
standards, developing common standards for 
investigator training and certification, creat- 
ing a common portal for industry sponsored 
investigators and development of streamlined 
mechanisms to allow sponsors to obtain com- 
mercial comparator drugs for clinical trials*, 
This is not just an instance of precompetitive 
cooperation, but an explicit effort to better 
align the institutional context. 

The success of precompetitive collaborations 
depends on a dynamic balance of coopera- 
tion, competition and regulation. At present, 
these cooperative aspects are underdeveloped, 
particularly in the United States. The use of 
this institutional arrangement is certainly 
more advanced in Europe. For example, the 
Innovative Medicines Initiative (IMI) repre- 
sents a commitment of more than €2 billion 
(US$2.7 billion) for multi-stakeholder col- 
laboration. But this scale has been achieved 
through a structure in which industry has a 
strong role in driving the agenda—a setup that 
is helpful for generating industry contributions 
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but likely to be biased toward drug develop- 
ment and qualification rather than discovery 
and basic science. 

So if consortia have such potential for ben- 
efit, why is this type of innovation not sweep- 
ing the industry? Objections to consortia are 
many and include claims that consortia act as 
sinks for time and effort, are slow and burden- 
some and reveal the differences in worldview 
and priorities across sectors without solving 
those differences, and that historical norms of 
secrecy are insurmountable. With this in mind, 
in the following sections we assess the return 
on investment for consortia and attempt to 
identify which types of value this approach is 
most likely to achieve and at what cost. 


The anatomy of consortia 

The word ‘consortium, derived from the Latin 
for fellowship, invokes joint goals, the synergy 
of shared action and the notion of an under- 
lying (negotiated) agreement. At its core, a 
consortium involves a new form of alignment 
among independent but interdependent stake- 
holders. More precisely, we define the stake- 
holder alignment needed for a consortium to 
succeed as the extent to which interdependent 
stakeholders orient and connect with one 
another to advance their separate and shared 
interests. 

Consortia provide mechanisms for stake- 
holders to ‘orient and connect’ with one 
another, and they deliver on both ‘separate 
and shared interests. Furthermore, stakehold- 
ers can be aligned (or misaligned) along dis- 
tinct dimensions. Alignment can be behavioral 
(information sharing, trust and so on), struc- 
tural (forums, incentives and so on), strategic 
(strategic intent and realized strategies) or 
cultural (underlying values and assumptions). 
In considering consortia, we consider all these 
dimensions of stakeholder alignment. 

Despite their underlying similarities of pur- 
pose, consortia differ markedly in the details of 
membership, operating policies and practices 
and the fates expected of the products of their 
activities. Which behavioral and structural dif- 
ferences are responsible for whether a consor- 
tium accomplishes its goals is an important and 
insufficiently understood question. 

The effective use of precompetitive consortia 
depends on a nuanced understanding of their 
behavior, structure, strategies and culture, as 
well as the availability of appropriate instru- 
ments and methods to monitor and evaluate 
them. Developing these takes time, though 
probably less than the usual 12-15 years 
needed to take a new compound to market. The 
time function, therefore, has two aspects: the 
time required to establish and operate a con- 
sortium to completion, and the time required 
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to develop and apply assessment tools and to 
analyze the resulting data. An iterative cycle 
of assessment and improvement can then be 
instituted, optimizing the return on the invest- 
ment of time and resources. 

The development of a consortium requires 
the identification of a shared need and the 
complementary resources to be contributed by 
the participants, as well as the establishment of 
working rules that govern the shared activity 
and its products. The time required to do this 
is a function of factors, such as the specificity 
and concreteness of the task, the similarities 
or differences that animate the participants 
and the value of the products. The process of 
getting the Biomarkers Consortium to launch 
took approximately two years of negotiations, 
which focused on bridging the divergent stan- 
dards and practices of industry, government 
and academia with respect to, for example, 
intellectual property. Although each project 
needs a tailored IP agreement, underlying 
consortium-wide policies provide a common 
point of departure for such agreements. 

Coming to agreement regarding how the 
financing and resources will be arranged and 
overseen may be difficult. Most consortia in 
biomedicine depend on professional staff 
(which requires central funding) and addi- 
tional resources to be targeted for selected 
projects. Aligning incentives—financial and 
nonfinancial—is key and requires an under- 
standing of the reward systems in place for 
each of the participants. Consortium structures 
must ensure that each party receives the nec- 
essary individual benefits while accomplish- 
ing collective objectives, or the consortium 
is doomed to fail. Poorly aligned incentives 
cause failures, as they do not provide suffi- 
cient impetus to engage partners and elicit the 
commitment of necessary funds as competing 
priorities, opportunities and needs arise. 

Likewise, investment in a consortium 
usually represents a very small fraction of 
the costs of a drug development program. 
Because the vast majority of drug devel- 
opment efforts fail in an expensive way— 
whether early or late—and the failure of a 
consortium is relatively inexpensive, experi- 
mentation in consortium development may 
well be cost effective. Cost effectiveness, in 
this context, relates to the potential value of 
the consortium process and products versus 
the value of doing it alone. Embedded in this 
notion of consortium value is the deeper 
notion of the contrast between proprietary 
and precompetitive aspects of the drug 
development value chain. Collaboration, in 
the framework of a consortium or other part- 
nership, implies that sharing resources, risks 
and products can be mutually beneficial. In 
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Box 1 Assessing the Biomarkers Consortium 


After six years in operation, does the Biomarkers Consortium represent a good return on 
investment for its members? The consortium has launched 15 projects in areas such as 
Alzheimer’s disease, cardiovascular disease and breast cancer, with several more projects 
now under consideration. The consortium’s first project, ‘The utility of adiponectin as a 
biomarker predictive of glycemic efficacy’, was completed in 2009 (ref. 18); one other 
project has now been completed, and two more are slated for completion this year. 
Through the consortium, industry has been able to constructively engage with the 
FDA on a precompetitive basis and understand the agency’s thinking about biomarker 
qualification as a tool for drug development—opportunities that are useful for projects 
from the consortium as well as for other activities. At the same time, no qualified 
biomarker developed through the Biomarkers Consortium has yet delivered tangible results 
to patients or other stakeholders. Is this what must happen if continued engagement is to 


be expected in the consortium? 


contrast with a joint venture or strategic alli- 
ance, a precompetitive consortium involves 
more open sharing, a risk that is balanced 
against the potential benefit of inputs from 
multiple stakeholders. At the other end of the 
cooperative spectrum, open innovation, such 
as crowdsourcing, involves more transpar- 
ency and proportionally greater risk, coun- 
terbalanced by the potential for more diverse 
and unexpected inputs. In these senses, pre- 
competitive consortia represent a middle 
path between risk and benefit. 

Precompetitive sharing of tools and 
platforms can facilitate comparison across 
industry and academia, facilitate regulators’ 
abilities to make data-driven assessments and 
clarify regulatory principles without imping- 
ing on the business model of big pharma?. 
Similarly, some argue that target sharing 
should also fall into the precompetitive 
space, allowing cost and risk sharing early in 
the development process without interfering 
with later-phase proprietary development of 
leads, as is currently done®. As noted above, 
p38 kinase target programs initiated by many 
companies hit a dead end that represented 
a massive loss of resources to biomedicine 
as a whole—in the direct costs as well as in 
opportunity costs from other, more viable 
programs not being pursued. 

As demonstrated by the Biomarkers 
Consortium (see Box 1), however, managing 
expectations, understanding the real poten- 
tial and limitations of consortia and carefully 
monitoring and evaluating progress and out- 
comes are key to understanding the place of 
consortia in the complex ecosystem of drug 
development. Ultimately, the fundamental 
question for all stakeholders in drug devel- 
opment is whether the continued, current 
independent patterns of interaction will be 
sufficient to sustain the enterprise in the 
future, or whether a rebalancing of compe- 
tition and collaboration is needed. 
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Surmounting drug development’s 
drawbacks 

The inefficiencies and high attrition rates of 
current drug development are well known’®. 
These challenges arise on several levels. 


Structural and strategic challenges. Not only 
is the overall process for drug development in 
crisis, but there is also ambiguity in the overall 
strategy. Niche drugs with stable but limited 
markets are often discarded in favor of poten- 
tial blockbusters targeting common conditions. 
Focusing on disease entities rather than under- 
lying pathobiological pathways dictates clinical 
trial designs that fail to account for heteroge- 
neity among patients. The lack of biomarkers 
and other patient-stratification tools prevents 
optimization of trials to avoid toxicity and 
maximize efficacy. Regulatory decision making 
would be facilitated and improved by the avail- 
ability of data and qualified tools’. The barriers 
arise from limitations in understanding of the 
biology, validated targets and adequate trans- 
lational tools. Such understanding does not 
provide competitive advantages, and failing to 
share it creates major practical disadvantages. 

Pooling of resources, data, tools and plat- 
forms—and sharing of the resulting products— 
offers a way to streamline drug development. 
Consortium arrangements allow the harmoni- 
zation of tools and markers developed through 
shared efforts and shared in a precompetitive 
manner, thereby providing new standards and 
regulatory benchmarks that are informed by 
science, technically validated and open for 
qualification for use. Consortia also offer 
the opportunity to explore new clinical trial 
approaches such as to focus efforts through 
adaptive designs, stratify patients according to 
risk or markers, test across disease boundaries 
according to implicated molecular pathways 
and/or interrogate data sets for new hypotheses 
and signals. Thus, consortia offer opportuni- 
ties for resource leverage, precompetitive assay 
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development, knowledge sharing, target vali- 
dation and so on. Such sharing, in turn, enables 
and promotes competition by allowing more 
and higher-quality molecules to be developed 
by companies that will compete using these 
new drugs in the marketplace. 

The paradox that precompetitive sharing 
promotes, rather than impedes, competition 
depends on carefully defining the crux of the 
competitive business model for the industry!®. 
Pharmaceutical companies generate value and 
income through the development and market- 
ing of medicines. Cost centers that siphon off 
resources reduce overall profits, and reductions 
in such costs improve the return to investors. 
Identifying cost centers that can be reduced 
by cost sharing provides value and increases 
return on investment. 

Designing such cost-saving and risk-sharing 
arrangements depends on engaging the appro- 
priate partners in consortia. From a structural 
standpoint, the drug development ecosystem 
can be said to comprise six major types of 
stakeholders: (i) pharma; (ii) biotech (mainly 
small and medium-sized companies (SMEs)); 
(iii) academia; (iv) government labs (such as 
the US National Institutes of Health (NIH)); 
(v) government regulators (such as the FDA), 
and (vi) patients, non-profit organizations 
and civil society. Each stakeholder is driven 
by distinct motives and incentives, answers 
to distinct authorities and has individual met- 
rics for progress and success. Common to all, 
however, is the goal of improving health and 
well-being for patients. Major changes, at 
increasing pace, are occurring within each 
stakeholder community, and these changes 
increase instability in the ecosystem and pro- 
vide pressure and opportunity for cross-sector 
collaborations. New forms of behavioral, struc- 
tural and cultural alignment—in which adjust- 
ments are made by all stakeholders, taking into 
account their separate and shared interests— 
will be needed. 

The dynamic structural landscape is charac- 
terized by M&As that have reduced the num- 
ber of large pharma companies by nearly half. 
Furthermore, cutbacks in pharmaceutical R&D 
funding, dissolution of discovery groups and 
shuttering of facilities are common following 
M&As. Many small biotech companies have 
changed their business models, shifting from 
fully integrated pharmaceutical companies to 
simply targets for purchase by a larger com- 
pany (preferably a pharmaceutical company) 
through advancing an area, target or molecule 
to the point where the larger organization can 
take it through the clinic (thereby substituting 
the smaller company for in-house R&D capa- 
bilities). Thus, new drugs originate increas- 
ingly, albeit inefficiently and unpredictably, 
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in smaller enterprises. The lack of efficiency 
and predictability is manifest as an inadequate 
return on investment and provides justification 
for the reduction in R&D both within compa- 
nies and after M&As. Academic stakeholders 
are also in a state of flux, with public universi- 
ties facing funding crises and organizational 
challenges to facilitating interdisciplinary 
work. Traditional tenure and promotion cri- 
teria do not fully recognize the value of team 
science and of crossing departmental bound- 
aries. Finally, government labs and regulatory 
authorities face the cross-currents of increas- 
ingly partisan politics and fundamental debates 
over the level and nature of government spend- 
ing on many aspects of health. 

Neither universities nor government is con- 
figured to develop, test and market medicines, 
and it is not clear that these activities are con- 
sistent with the mission of either. The academic 
mission to educate and to generate knowledge 
has important contributions to make to drug 
development, as do the scientific and regula- 
tory missions of government agencies, but in 
neither case does that directly involve the busi- 
ness of medicines. Patient and public advocacy 
organizations and other mission-driven non- 
profits are playing an important part in filling 
gaps not fully served by market forces, but this 
is an idiosyncratic process that depends on the 


emergence of leadership in a given disease area. 
Despite the emergence of venture philanthropy, 
patients are poorly represented in the drug 
development decision-making process overall. 
A further limiting factor is the availability of 
patients to enroll in clinical trials and to serve 
as a source of specimens and data. This is par- 
ticularly true in circumstances where a disease 
often occurs with independent but co-morbid 
conditions, causing numerous exclusions from 
entry into clinical trials, and/or where patient 
populations are small. 

The present misalignment of organizational 
or sector goals and incentives, however, con- 
strains the ability of the ecosystem to acceler- 
ate the delivery of cost-effective solutions for 
health problems. These are the opportunities 
where consortia can make a difference. 


Cultural challenges. Structural and strategic 
challenges that make alignment of parties and 
incentives difficult are also accompanied by 
deep cultural challenges!!. Secrecy and privacy 
as operating principles are deeply ingrained in 
the pharmaceutical industry—is designed to 
protect market advantage by hiding the iden- 
tity of targets of interest, clinic and therapeu- 
tic areas under investigation, and even sharing 
between preclinical and clinical teams within 
the company is unusual in many organizations. 


Box 2 Elements of effective, multi-stakeholder consortia 


Although there is no one-size-fits-all solution, several common elements are shared by 


successful and effective consortia. These are: 


¢ Shared goals and vision. A clear statement of the consortium’s goals and overall vision 
for success (including a shared understanding of what these words mean). 


making. 


Engaged stakeholders. A definition of a governance structure and rules for decision- 


Leadership and trust. Leaders able to be effective on the basis on influence, rather than 


authority, with trust among principals providing a necessary foundation as policies and 


norms develop. 


Roles, responsibilities and rights. Well-defined roles, responsibilities and rights for all 


participants—typically in the form of a charter, with monitoring, learning and conflict- 
resolution mechanisms that enable necessary adjustments over time. 


Policies, protocols and standards. Well-articulated policies, protocols and standards 


governing the workings of the consortium and how the participants relate to one another 
(for example, in the areas of confidentiality, conflicts of interest, intellectual property, 
consortium grants and contracts, data access and data sharing, finances and flow of 


resources). 


Milestones and benchmarks. Benchmarking and communication processes to ensure 


that all parties—participants, stakeholders and beneficiaries, including regulators, 
clinicians, patient groups and advocates, the public, and so on—have an appropriate 
understanding of the risks and accomplishments. 


Exit and transformation plan. A built-in capacity for transformation, should fundamental 


changes in strategy, structure and process be necessary. An ‘exit plan’ should describe 
the conclusion of the consortium either when goals have been met or when it is 
determined that goals, milestones or benchmarks cannot be met. 
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There are, of course, major cultural differences 
among and within companies, owing in part to 
the differences in nature between small, startup 
organizations and large, venerable, complex 
and isolated organizations with established 
ways of acting. 

Of course, industry is not the only sector 
that is culturally diverse. Within government, 
cultural differences exist between the FDA, 
the NIH and the Center for Medicare and 
Medicaid Services, as a result of the agencies’ 
diverse missions as well as the training, experi- 
ence and habits of the people working there!?. 
Within academia, there are cultural differences 
between institutions regarding attitudes toward 
technology transfer and the relative values of 
translational research and basic science. 

Perhaps the deepest cultural challenge 
involves the lack of understanding in the 
general population of the risk that accompa- 
nies all therapies. Finally, important distinc- 
tions and deeply rooted cultural assumptions 
about health and disease are in the back- 
ground of each sector and individual within 
the system. 

Having drawn these broad characterizations, 
it is also clear that no sector or organization 
is monolithic; distinct (micro-) cultures exist 
within many or all companies, agencies and 
organizations. Aligning incentives, promoting 
effective and substantive communication and 
developing viable shared practices depends on 
a sensitivity to the cultural bases of behavior, 
expectations and willingness to assume risk or 
uncertainty. 


A proposal for wider adoption of consortia 
We propose that, if cross-sector consortia are 
used judiciously, drug development writ large 
can more effectively promote the broad needs 
of patients and society, competition and the 
profit motive can drive innovation in the 
marketplace, regulation will enforce public 
health and safety standards, precompetitive 
collaboration will reduce waste and minimize 
risk, and collaborative approaches to science 
will open up new frontiers. Elements of this 
vision currently motivate the drug develop- 
ment enterprise at all levels. Missing, how- 
ever, are mechanisms that align and orient 
the stakeholders so that it is possible for 
them to function together in a coherent and 
efficient system. Ad hoc approaches to drive 
change have been undertaken by subsets of 
stakeholders, including the NIH Roadmap, 
the FDA Critical Path Initiative and the 
European Commission’s IMI. Academic 
and commercially sponsored conferences 
addressing crucial issues—such as the inter- 
section of disease and wellness, the handling 
of ‘small n’ diseases, the boundary between 
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the precompetitive and the competitive and 
the best processes for road mapping—have 
been mounted, but only by subsets of stake- 
holders. 

A skillful mix of collaboration, competition 
and regulation based on both lateral and cross- 
domain alignment of stakeholders can lead to 
the efficient and effective development of con- 
sortia, which, coupled with well-managed and 
reasonable expectations, could define the right- 
ful place of this approach within the repertoire 
of drug development tools and approaches. 
Optimizing consortium structure and ensuring 
its ‘goodness-of-fit’ to the activities and goals of 
the consortium are crucial elements of success. 

No single structure or set of terms is univer- 
sally useful!3, However, all successful and effec- 
tive consortia, share the following components 
(see Box 2): (i) well-defined shared goals and 
objectives, (ii) well-specified governance and 
decision-making responsibilities, (iii) clearly 
articulated roles, responsibilities, contributions 
and returns of the partners, (iv) appropriate 
policies defining, for example, antitrust, pub- 
lication and IP that are spelled out at the outset 
and adapted through experience, (v) leader- 
ship based on influence more than authority, 
and (vi) mechanisms for shared learning and 
continuous improvement. Further policies 
may also be needed and should be customized 
to the specific partnership or consortium. 
Finally, sufficiently specific and well-defined 
milestones and benchmarks permit concise 
understanding of a consortium’s ‘end points’ 
and allow allocation of the correct amount of 
capital for a dedicated problem. Once capital 
allocation and milestones are defined, risk is 
lowered and expectation of success is inher- 
ently more realistic. The advantages of con- 
sortia must therefore be understood from two 
distinct vantage points: that of strategy and that 
of realizing operational efficiencies. 

Criticisms of consortia are generally based 
on the long time required for them to generate 
results, their excessive bureaucracy, the risks 
in sharing information, and specific regulatory 
barriers around certain forms of collaboration. 
But pooling resources has been shown to has- 
ten the availability of data and tools, result- 
ing in the availability of better biomarkers, 
improved clinical trial designs, expanded data 
sets and other advances. Likewise, the ability 
of regulators to access consensually validated 
and broadly available tools is valuable to facili- 
tate effective regulatory decision making. The 
decision to develop a consortium rests on bal- 
ancing the strategic and operational benefits 
to be gained the risks in time and resources 
and the likelihood of success. The goals and 
the operational structures developed to accom- 
plish them can be translated into benchmarks 
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and metrics. Monitoring the progress of the 
consortium toward its goals can also permit the 
timely dissolution of arrangements that are not 
succeeding, allow necessary course corrections 
and point to improvements in structure and/or 
functioning of the partnership. 

The subject of when and how to use the con- 
sortium approach and how to assess its success 
is itself ripe for scientific investigation!>. Such 
a ‘science-of-science approach is founded on 
underlying theory-based principles, including 
the notion of internal and lateral alignment of 
parties in negotiation with one another!®!”, 
Major pharma companies are complex and 
internally diverse organizations—as are gov- 
ernment agencies and universities—and 
internal alignment and even internal com- 
munication can be very challenging and not 
culturally expected. Coordination within com- 
panies to determine when it is better to operate 
independently and when to work in partner- 
ship with others is relatively new. Some major 
pharmaceutical companies, such as Johnson 
and Johnson, now have internal forums in 
which company representatives involved in 
different consortia compare notes and develop 
coordinated strategies for engagement with 
consortia!4, 

The institutional arrangements associated 
with consortia are still in the early stages of 
development compared with the infrastruc- 
ture associated with M&As. Consortia repre- 
sent a contrast with the established systems for 
identifying and pursuing acquisition targets as 
a source of innovation. Consortia effectively 
round up all the potential buyers (pharma) 
of a technology and can give them access to 
the technology at ‘bargain-basement’ rates, 
with some development funding as the price 
of admission. The result is that what could be 
a market-driven, inter-company, competitive 
bidding process for a technology asset may 
become, through a consortium, a one-time 
sale to a pharma buyers’ club spanning much of 
the market. A small company in a consortium 
risks having its IP accessed by all its potential 
customers but gains an inside track for visibil- 
ity. And the risks of not participating may be 
worse—the consortium might fund an alterna- 
tive route or breakthrough. It is possible that in 
the future, investors may even hold it against 
companies that are not in on the ground floor 
with consortia that are in an appropriate stra- 
tegic space, just as they now will hold it against 
a firm (large or small) that is not managing the 
M&A space well. 

Consortia are not yet well institutionalized 
across the industry or among other stakehold- 
ers. By virtue of their diverse membership, 
divergent underlying cultures and varying busi- 
ness structures and practices, they are dynamic 
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arrangements requiring alignment within each 
stakeholder organization as well as among the 
stakeholder organizations themselves. When 
linear assumptions of cause and effect are 
applied to complex systems, the outcomes may 
be unanticipated, especially if little attention 
has been given to mechanisms for feedback and 
adjustment. Clear structures and policies for 
shared activities, as well as defined benchmarks 
and strategies for change, provide a framework 
for consortium success and for understanding 
the forces that govern that success. 

Achieving lateral alignment (among stake- 
holders) is a process that may be quite unfa- 
miliar to parties accustomed to working within 
hierarchical structures. Consortia are charac- 
terized by lateral and multi-layered relation- 
ships, requiring more influence and problem 
solving than authority and direction—skills 
that can vary considerably across consortium 
participants. Clarity in structure and shared 
vision regarding the goals and manner of inter- 
action among the participants serve to provide 
common working rules and to promote sus- 
tained alignment across stakeholder groups, 
thereby increasing the ability of the consortium 
to accomplish its goals and to satisfy its partici- 
pants. Maintaining consistent vision and goals, 
trust relationships and leadership can be chal- 
lenging given the relatively high rate of person- 
nel turnover or change of roles within industry, 
as well as in other stakeholder sectors. 


Challenges of consortia 

Better metrics and measurement are needed to 
assess whether biomedical consortia are pro- 
gressing toward their goals and delivering value 
to their participants. Individual stakeholders 
will pull back if they do not see their interests 
advanced, yet if they are focused only on self- 
interest, the consortium will not be sustain- 
able. Thus consortia need to deliver on a mix 
of individual and collective interests among 
diverse stakeholders. Table 1 presents an ini- 
tial schema of parameters and metrics that can 
assist in the monitoring of a consortium and 
help to provide guidance for course corrections 
and/or disbanding the consortium when goals 
are met or when it becomes apparent that they 
will not be met. 

There are challenges apart from those 
of establishing the needed constituents, 
resources, structures and operations and of 
setting up the appropriate monitoring sys- 
tems to ensure timely accomplishment of 
benchmarks, milestones and metrics, but 
they are not as easily quantified, assessed or 
rectified. Internal alignment within mem- 
ber organizations occurs outside of the view 
of the consortium but is crucial to the joint 
venture’s success!>!8, It is important that 
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Table 1 Benefits, risks and relevant metrics for consortia 


Goals and objectives 


Benefits 


Risks 


Sample protocols and metrics 


Prevent redundancy 
and waste 


Provide access and 
synergy 


Shared cost 


Shared risk 


Networking, contact 
and information 


¢ Reduction of cost 

e Enabling of diverse inputs 

¢ Mitigation of risk of parallel dead ends 
¢ Ability to ‘fail early, fail cheap’ 


© Access to new tools, samples, platforms, 
patients and so on 


¢ Facilitation of work that could not be done 
by any one stakeholder alone 


e Identification of combined opportunities as 
a result of diverse interests 


¢ Reduction of front-end expenditures 
e Full benefit at fractional cost 


¢ Shared responsibility 

© Opportunity to influence risk profile 

¢ Engagement of new parties in the work 
© Acquisition of useful information and 


¢ Expenditure of time and effort to reach 
agreement or consensus 


¢ Unwieldiness and time costs of bureau- 
cracy and operational processes 


° Differences among stakeholders in willing- 
ness and timing to kill projects 


¢ Increase in risk of competitors gaining 
insight into internal strategies and plans 


¢ Driving of increased sharing of resources 
and information by expectations of reci- 
procity 


* Cost of participation in personnel costs and 
other infrastructure costs 


© Reduction of value owing to shared benefit 
¢ Diffuse credit 

¢ Unpredictable sharing of blame 

¢ New form of antitrust liability possible 

© Confidentiality risks 


e Assess stakeholder alignment, both inter- 
nal and external 


© Determine lead time to initiation of activi- 
ties 
¢ Assess progress by setting milestones 


© Model opportunity costs of participation 
versus those of nonparticipation 


e Measure access and cost of access 
© Model strategic risk/benefit ratio 


e Establish mechanisms for ‘modular’ 
participation 


e Measure actual costs 
© Model benefit/cost ratio 


© Model risk/benefit ratio 


e Assess and/or model market prospects 
e Assess and/or model stakeholder returns on 


exchange insights 


© Chance to convey useful perspectives and 


influence others 


© Changes in market share/prospects 


stakeholders more than others 


e Reduction of traditional antitrust liability 


risk 


Transparency e Improved public trust 


e Ease of joining and participating 


¢ Knowledge of risks, benefits, 
milestones and metrics 


© Reduced exclusivity 


outcomes, : SA 
predictability 


Greater visibility of perspectives of less 
powerful stakeholders (particularly under 
consensus decision-making rules) 


members assign representatives who have 
sufficient gravitas within their own orga- 
nizations and the authority to commit the 
organization to the consortium. These rep- 
resentatives face predictable dilemmas—they 
may be accused in their own organizations of 
having ‘forgotten where they came from’ and 
having ‘gone native’ while being accused in 
the consortium of not moving fast enough to 
ensure progress. 

Trust among the members is key—particu- 
larly at pivotal moments in the life of the con- 
sortium, such as its formation—but difficult 
to force and measure. Without fundamental 
trust, commitment to persevere through points 
of disagreement and misaligned goals can be 
tentative or insufficient. Change in member 
personnel is an example of a pivotal event in 
the operation of a consortium with potential 
to undermine continuity, trust, efficiency and 
shared vision. Changes of personnel occur fre- 
quently in industry and other sectors, so this 
is a predictable challenge. Consortium policies 
and practices must be written to accommodate 
communication between leaders and members 
and among members as a means not only to 
exchange information but also to embody the 
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commitment to sharing, trust, transparency 
and maintaining a common vision. 

Precompetitive sharing should provide ben- 
efit to all of the participants and often to the 
public and nonparticipants as well. The moti- 
vation and ability of a stakeholder to join such 
an effort depends on several factors: whether 
consortium goals fall outside of or are tan- 
gential to the central business model or rev- 
enue center of the organization; whether the 
organization can tolerate devoting manpower, 
energy and possibly monetary and/or tangible 
resources to the effort; and whether the poten- 
tial benefits of sharing exceed the risks. Of 
course, not all stakeholders can engage equally 
in precompetitive consortia. 

Biomarkers, for instance, are precompetitive 
tools for the pharmaceutical industry, whose 
profit model centers around the registration 
and marketing of therapeutics. For a small bio- 
tech or device company, however, the discov- 
ery, development, qualification and marketing 
of biomarker tests is not precompetitive and 
may be the center of their financial model. 
Likewise, information sharing may prove much 
riskier for an SME with a limited portfolio of IP 
and know-how. On the other hand, consortia 
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e Information exchange that benefits some 


¢ Decreased flexibility and agility as a result 
of increased structure and operational 


investment 


e Assess changes in regulatory policy, market 
forces, public opinion and so on. 


¢ Measure impact within stakeholder groups 


¢ Measure ‘organizational drag’ imposed by 
consortium rules and operations 


offer smaller enterprises a greater chance that 
their perspectives will be seen and valued by 
regulators and other leaders. Transparency and 
open sharing of information can markedly alter 
the usual licensing and sales process, either 
undermining or enhancing the ability of the 
SME seller to market itself and its product(s). 
Protecting the competitive edge and promoting 
sharing as a means to accomplish efficiencies 
of scale and synergies from bringing diverse 
resources together are counterbalanced, then, 
by the needs of academics to maintain their 
exclusive roles as experts, inventors and dis- 
coverers of knowledge; of SMEs to thrive in 
the market ecosystem (by bringing a product 
to market or by selling themselves as a product 
in the market); and of large corporations and 
SMEs to sustain a tenable business plan. 


Concluding remarks 

Factors challenging drug development, such 
as rising costs, increasing failure rates and 
regulatory uncertainty—set in a climate of 
global economic strain and increasing pres- 
sures on the costs of health care—are pushing 
the pharmaceutical industry to become leaner, 
more efficient and more effective in delivering 
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products to the public. The present system is 
not working. Among available options, pre- 
competitive consortia, with defined goals and 
structures that align the incentives of all stake- 
holders, provide an option that can be strate- 
gic and low cost to individual stakeholders and 
can facilitate the delivery of shared tools and 
approaches to reduce both cost and risk. 

Ultimately, any system has four basic ele- 
ments: inputs, processes, outputs and feed- 
back. In the drug development ecosystem, 
there is substantial variability in all these 
elements. The building and examining of the 
role of consortia is an opportunity for a rebal- 
ancing of the competitive, collaborative and 
regulatory aspects of the system. Competition 
should be sustained in domains where it con- 
tinues to drive innovation and serve society. In 
those where efforts are being duplicated, and 
resources are being wasted, precompetitive 
consortia and other mechanisms may produce 
better outcomes for the respective stakehold- 
ers and the system as whole. Consortia provide 
regulators and regulated parties more oppor- 
tunities for mutual learning, which serves the 
overall public interest. 

Consortia are just one tool to address sys- 
temic problems in biomedicine. Here, we 
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have presented a proposal for how to increase 
the utilization and effectiveness of consortia. 
In fact, the biomedical enterprise will ulti- 
mately need to be realigned to achieve a more 
dynamic mix of competition, collaboration 
and regulation if it is to continue to deliver 
on the promise of improved societal health 
through the acceleration of change in science, 
technology and society. 
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